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Careless.data 


The authorities must take the necessary time to remedy the slapdash introduction of a database 
containing the medical records of the entire population of England. 


able ability to integrate medical records into a data system that 

benefits patients and researchers alike. What a shame that it is 
making such a hash of the task, and undermining crucial public and 
professional trust in the process. 

The National Health Service (NHS) in England has postponed the 
launch of its controversial care.data programme for six months, aban- 
doning its earlier plan to start uploading data from this spring. The 
sensitive data consist of previously confidential records from people's 
consultations with their doctors, such as their family history, diagnoses, 
prescriptions and the results of blood or other tests. 

The data store, managed by the NHS’s Health and Social Care Infor- 
mation Centre (HSCIC) in Leeds, is part of an ambitious scheme to link 
patients’ doctors’ records with their hospital and other medical data, to 
create one of the world’s most comprehensive health-care databases. 
The plan is initially for the NHS to use the data to improve health- 
care management, with researchers and other users eventually being 
allowed access. 

But the programme has been controversial, raising serious concerns, 
highlighted by this journal in January (see Nature 505, 261; 2014), 
that the authorities have paid lip service to informed consent, leaving 
the public in the dark about how their personal data would be safe- 
guarded and used, and obfuscating their right to opt out. Opposition 
from patients, doctors and campaign groups swelled into open revolt in 
early February, with key bodies, such as the British Medical Association 
and the Royal College of General Practitioners, joining the chorus of 
criticism that the programme was being rushed. 

On 19 February, the NHS, which had stubbornly dismissed such 
criticisms, reluctantly admitted that its critics might have a point, and 
postponed the launch until the autumn. But the agency still seems 
to be in denial, arguing that opposition is merely down to problems 
of communication and public perception. That response is wrong. 
Care.data is far from ready for launch. 


\ ngland’s universal health-care system has potentially an envi- 


PUBLIC TRUST 
Incredibly, with just a few weeks to go to the scheduled launch, the NHS 
had not even laid out in detail which groups would be able to access the 
data, and on what terms. Thus, the public could not fully know what 
they were signing up to, raising fears that personal data might end up, 
for example, in the hands of insurance and other commercial compa- 
nies. Economic growth is a core goal of any UK government, but there 
has to be confidence that this won't trample on the rights of individuals. 
Adding to the mess, over the past week it has emerged that the Leeds 
centre has handed over records from its existing massive database of 
personal hospital data to outside groups in ways that might have con- 
travened its own rules. At the very least, these episodes raise questions 
about the functioning of the HSCIC, but they also risk diminishing 
what little public confidence is left in care.data. 


Moreover, at a time when there has never been a greater need for the 
operators of care.data to win public trust, the HSCIC has stonewalled 
about explaining the terms and conditions of the data transfers made. 
It must quickly disclose full details of the transfers, and of any past 
violations. 

Jeremy Hunt, the UK health secretary, seems to have grasped that the 
row over care.data is not just a public-relations hitch, and that the hap- 
hazard planning puts the scheme at risk of going into meltdown, with 
the loss of many of its benefits. Last Friday, he announced his intention 

to legislate to prevent the HSCIC from releas- 


“Care.data ing pseudonymized data — in which patients’ 


needs to be NHS numbers are replaced by pseudonyms, 
thought about but where people can often be re-identified 
more deeply and  —unless there were clear health benefits, and 
recrafted, fully to ban the release of data for commercial pur- 
engaging all poses. He also said that the HSCIC would be 


subject to new external statutory oversight. 

Hunt’s move is a step in the right direc- 
tion, but the programme needs to be thought about more deeply and 
recrafted, with the necessary time given to engage all stakeholders more 
fully, including researchers and opponents of the scheme. 

Whatever laws are introduced in England, they will be subject to 
an upcoming European Union (EU) General Data Protection Regula- 
tion, which will be legally binding on member states and so override 
national laws. The EU law, a major revision of 1995 legislation, also 
governs the use of medical records, and last October, the European 
Parliament’s Committee on Civil Liberties, Justice and Home Affairs 
voted for amendments that would remove many of the exemptions for 
medical research. 

Although anonymized data could be used without restriction, one 
amendment would allow the use of pseudonymized data without con- 
sent only if the research was of “high public interest” and could not 
be carried out otherwise, with requests needing to be reviewed by an 
independent body. A joint statement in January by dozens of Euro- 
pean research organizations and charities warned that medical research 
would be “severely threatened” by this, and pointed out that the use of 
such data is already subject to strong institutional ethical oversight. 

Although the use of personal data for research, with the attendant 
high ethical standards, is probably the least of the worries, scientists 
must be sensitive to EU public concerns about the collection and use of 
health data more broadly. As always, democratic scrutiny, and striking 
a balance between regulatory overheads and genuine concerns, will 
be key. Again, it will be important to take the time to have the debate 
needed to get this balance right. 

Itis a fundamental human right that people can determine how their 
personal medical data are used, and exceptions to specific informed 
consent cannot be taken for granted. Informed consent is not an obstacle 
to be overcome, but a principle to be respected and cherished. m 


stakeholders.” 
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Full disclosure 


Regulatory agencies must demand conflict - of- 
interest statements for the research they use. 


the Watergate political scandal by two Washington Post reporters, 

that popularized the phrase: “Follow the money.’ He who pays the 
piper calls the tune. Science combats the undue influence of com- 
mercial interests — or at least tries to — by using a different guideline, 
illustrated by a popular catchphrase from another film: “Show me the 
money.’ Give us transparency. 

The selective promotion of scientific research to steer policy-making 
isa murkier business altogether — particularly in environmental policy- 
making, in which the battle for the ear of the piper between big business 
and the ‘little guy; who is often affected by pollution or hazardous sub- 
stances, is so asymmetric. The problem is not limited to climate change, 
which is only the most high-profile example at present. 

It has been more than a decade, for example, since David Michaels, 
previously a public-health researcher at George Washington Univer- 
sity in Washington DC, and Wendy Wagner, an environmental-law 
specialist, broached the issue in the pages of Science (D. Michaels and 
W. Wagner Science 302, 2073; 2003). They warned that the evidence 
base of important regulatory standards is undermined by the limited 
scrutiny of private research submitted to regulatory bodies, and by the 
fact that these bodies often do not require disclosure of researchers 
funding sources. 

Michaels is now in a position to do something about this. In 2009, he 
was appointed to lead the Occupational Safety and Health Administra- 
tion (OSHA), one of the US agencies he criticized in that 2003 piece. 

OSHAs remit is health-and-safety standards, and the test bed 
for Michaels’ stance is a 40-year effort to regulate exposure to silica 
dust. Crystalline silica dust is produced by processes such as con- 
crete grinding and sandblasting in construction and other indus- 
tries. If inhaled, it can cause silicosis — an incurable condition 


I twas the 1976 film All the President’ Men, about the uncovering of 


involving inflammation of the lungs — and lung cancer. 

As part of a consultation on tougher regulation of silica exposure, 
OSHA asked that people submitting scientific comments to the agency 
should declare financial conflicts ofinterest. According to Michaels, this 
might be the first time that any federal agency has made such a request. 

But even though this is a request and not a requirement, it has not 
gone down well in all quarters. In particular, a group of powerful US 
senators has come out against the idea that such a declaration should 

be part of federal rule-making (see page 18). 


“There isa They suggest that OSHA might “prejudge 
broad consensus _ the substance” of comments on the basis of 
in favour of such disclosures. 

transparency Nature — like many journals — has 
about ‘funding required such disclosures for years, and con- 
sources.” siders such opposition to be misguided. In 


controversial areas, these conflict statements 
pre-empt allegations of secrecy and bias that could distract from the 
central issues. And past failure to be transparent about such interests 
has led to scandals involving concealed or distorted evidence and ghost- 
writing, as has been well documented in areas from tobacco control to 
drug development. 

The medical profession and the pharmaceutical industry, to their 
credit, have taken major steps towards openness. Some researchers 
think that conflict-of-interest disclosures should go even further than 
they currently do, and should detail the contractual arrangements 
involved, such as whether the funder had a veto on publication. In 
science more generally, there is a broad consensus in favour of trans- 
parency about funding sources. 

Transparency is the best defence against the purchase of undue 
influence by those with the most financial clout. In areas where 
tough standards are needed to protect public health, and powerful 
and wealthy interests have a financial incentive to water down these 
standards, such transparency is more than desirable — it is essential, 
and history demonstrates that. Rather than challenging OSHA for 
requesting conflict-of-interest disclosures, US politicians should be 
asking why all federal agencies do not require them. After all, it is 
easier to the follow the money, and to make the proper decision, when 
all details are on full show. m 


Track and trace 


Identifiers that follow researchers’ work from 
grant to paper will make funding more effective. 


ore than half a million researchers have now signed up for 
M an online science passport: a unique 16-digit identity num- 

ber, with an accompanying online profile, from the Open 
Researcher and Contributor ID (ORCID) project. There, researchers 
can maintain an up-to-date record of their professional pursuits. 

Already, ORCID is being integrated into the ecosystem of science: 
many publishers accept ORCID identifiers in their manuscript- 
submission processes, and funders including the Wellcome Trust 
and the US National Institutes of Health are accepting the identifiers 
to streamline grant applications. Universities and research institu- 
tions are planning to use the system to track their researchers’ output 
throughout their careers. 

So far, the ORCID website has prompted scientists to record outputs 
such as articles, data sets, citations, patents and media appearances. 
This fits in with the growing desire of institutions and funding agen- 
cies to recognize the full range of researchers’ activities and impacts. 

But this week, ORCID begins to request a new set of data — inputs. 
Researchers logging in to their profiles will be prompted to add the 
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details of their grants, or to confirm information on grants they hold. 
Such information is often publicly available on the Internet, but scattered 
across funding-agency websites, rather than collated for individual sci- 
entists. ORCID hopes to improve tracking of the connections between 
the cash that funders pour into research and the results that emerge. 

Another service that makes it easier to link grants in with papers 
out is FundRef, launched last year by the non-profit publisher alliance 
CrossRef. It provides a standardized format for adding funding infor- 
mation to the metadata of research articles published online. 

The result — if such systems catch on — should be easier tracking 
of the efficiency of the science system. Which academics produce the 
most for the grants they receive, and why? What kinds of grants are 
most effective at prompting what types of output? That is something 
funders and economists would dearly like to know. They have made 
individual efforts, but a bigger-picture understanding has been held 
back by lack of connectivity across agencies. 

There is perhaps a danger that scientists — so used to measuring 
the properties of others — will be resistant to having information 
recorded on themselves. (Less than one-quarter of researchers 
signed up to ORCID have actually listed at least one output on 
their profiles.) But ORCID (of which Nature Publishing Group is 
a partner) gives researchers control over the 
information that they allow to be publicly vis- 
ible. Hopefully, they will embrace the oppor- 
tunity to make science funding more effective 
and evidence-based. m 
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WORLD VIEW .ennisicossen 


newspapers. When I later ran the text through Google Translate, 
I got a shock: I was referred to repeatedly as “he”. 

Like much science and technology, Google Translate has a male 
default. When I drive a car, the seatbelt is not designed to accommo- 
date breast tissue. Any medicines I take are more likely to have been 
tested on male than on female animals. There are moral issues here: 
women pay taxes and buy products and should not be short-changed. 
But scientific objectivity is at stake, too. Because medical research is 
done mainly in males, there is a male bias in, for example, the choice 
of drug targets. Science is halving the potential field of innovation. 

This is not about active discrimination; the bias is largely uncon- 
scious. Google Translate defaults to the masculine pronoun because 
‘he’ is more commonly found on the Web than 
‘she. Yet that is changing: an analysis of American- 
English texts in Google Books shows that the ratio 
of masculine to feminine pronouns has fallen to 
around 2:1, from a peak of 4:1 in the 1960s. 

In the summer of 2012, I invited Google and 
several language-processing experts to a Gen- 
dered Innovations workshop at Harvard Univer- 
sity in Cambridge, Massachusetts. They listened 
to the problem for about 20 minutes, then said: 
“We can fix that!” Although it is complicated, the 
search for solutions is on. Fixing the problem is 
great, but constantly retrofitting for women is not 
the best road forwards. 

A better way is to include gender at all relevant 
phases of research — when setting priorities, 
gathering and analysing data, evaluating results, 
developing patents and, finally, transferring ideas 
to markets. Science and technology should take into account the bio- 
logical and social needs of both women and men. 

Unconscious sex and gender bias can be socially harmful and expen- 
sive. In automotive engineering, short people (many women, but also 
many men) are classed as ‘out-of-positio drivers and often ignored. 
This leads to greater injury in accidents. In medicine, osteoporosis has 
long been defined as a disease mainly of post-menopausal women — 
an assumption that has shaped screening, diagnosis and treatment. Yet 
after the age of 75, men account for nearly one-third of osteoporosis- 
related hip fractures. And in basic biomedical research, the failure to 
use female cells, tissues and animals can lead to greater health risks for 
women. Of the ten drugs withdrawn from the US market between 1997 
and 2000, eight posed greater threats to women than to men. Develop- 
ing a drug costs billions of dollars, and failure can 


E Madrid a couple of years ago, I was interviewed for Spanish 


cause human suffering and death — with stakes DNATURE.COM 

this high, why ignore half of the population? Discuss this article 
In December last year, the European Com- online at: 

mission took a historic step towards ending this _go.natuire.com/nGpglw 


INCLUDING 


ANALYSIS IN 
RESEARCH CAN SAVE 
US FROM 


LIFE- 


THREATENING 
ERRORS. 


Scientific research must 
take gender into account 


From car design to drug discovery, the failure to acknowledge sex differences 
can be costly and even lethal, argues Londa Schiebinger. 


unconscious bias. Applicants to its newly opened Horizon 2020 fund- 
ing scheme are nowasked to include gender analysis in their projects 
— for example, to assess whether the research will have different 
implications for women and men. The commission identified dozens 
of science areas that could benefit from gender analysis: computer 
hardware and architecture, biodiversity, ecology, biophysics, ocean- 
ography, geosciences, organic chemistry, aeronautics, space medicine 
and some 40 others, including nanotechnology (astrophysics did not 
make the cut). 

Since 2006, the Canadian Institutes of Health Research has required 
scientists across its 13 institutes to analyse sex and gender (when 
appropriate); and since 2008, the philanthropic Bill & Melinda Gates 
Foundation has required gender analysis for its agricultural research 
grants. The European Commission’s move with 
Horizon 2020 is the most significant in terms 
of scope. 

Resistance sometimes comes from those who 
ask: won't including females in animal studies 
increase costs? For an individual laboratory, 
it probably will. But removing bias from sci- 
ence will cost society less in the long run — and 
save lives. How can we safely include women in 
phase III human trials (as required by US law 
since 1993) if drugs are not first tested in female 
laboratory animals? 

Many of these problems have been discussed 
before. But there are signs that more people are 
taking the issue of gender analysis seriously. Pub- 
lishers, for example, are catching on and insisting 
that published work accounts for gender. Clinical 
Orthopaedics and Related Research recommends 
that studies be sufficiently powered to analyse sex and gender, and 
in 2012, each of the American Physiological Society’s 14 journals 
required that authors report and analyse sex effects. More journals 
should follow. 

Including gender analysis in research can save us from life-threat- 
ening errors... and can lead to new discoveries. Gender analysis has 
led to better treatments for heart disease in women. Identifying the 
genetic mechanisms of ovarian determination has enhanced knowl- 
edge about testis development. Analysing how sex affects donor- 
recipient matching is improving stem-cell therapies. And exploring 
how sex-specific biological factors and gender-specific behaviours 
interact has helped researchers to understand how nutrients trigger 
cell functions, and may assist in the fight against obesity. 

Can we afford to ignore such opportunities? = 


Londa Schiebinger is the John L. Hinds professor of history of science 
at Stanford University, California, USA. 
e-mail: schieb@stanford.edu 
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RESEARCH HIGHLIGHT 


How cancer skirts 
brain defences 


Proteins that block cell death 
and help cells to integrate with 
blood vessels are crucial in the 
spread of cancer to the brain. 

Brain metastasis is often 
deadly, but most cancer 
cells that invade the brain 
die without establishing a 
tumour. To find ways in which 
successful invaders bypass 
the brain’s defences, Joan 
Massagué of the Memorial 
Sloan Kettering Cancer Center 
in New York and his colleagues 
looked at a set of genes 
expressed in brain metastases, 
focusing on two proteins called 
serpins. 

These serpins inhibit 
another protein called 
plasminogen activator, which, 
the authors found, kills cancer 
cells in the brain. Serpin 
expression shielded cancer 
cells from cell death and helped 
them to spread on the surface 
of capillaries, establishing 
ablood supply in their new 
home. 

Cell 156, 1002-1016 (2014) 


Changing colour 
under pressure 


Materials that change colour 
when pulled or squeezed 
could form the basis of display 
screens or sensors. But existing 
photonic gels, which change 
colour when deformed, cannot 
cover the entire rainbow or 
switch quickly. 
Now, Jianping Ge 
at East China 
Normal 
University in 
Shanghai and 
his colleagues 
have created a 
material that 
can adopt every 
colour from red 


Selections from the 
scientific literature 


Ancient cheese found with mummies 


The oldest known pieces of cheese have turned 
up in the tombs ofan early Bronze Age cemetery 


in Xinjiang, China. 


Andrej Shevchenko at the Max Planck 
Institute of Molecular and Cell Biology and 
Genetics in Dresden, Germany, Changsui 
Wang at the University of Chinese Academy 
of Sciences in Beijing, and their colleagues 
analysed 3,800-year-old lumps found at the 
neck and chest of mummies (pictured) in the 
cemetery and identified them as a ‘kefir’ cheese. 

This type of cheese is made by curdling 


to blue and is more sensitive to 
pressure than previous efforts, 
while being just as durable. The 
team used a mixture of ethylene 
glycol and polyethylene glycol 
methacrylate in which silica 
spheres were fixed to create 

a softer and more elastic gel 
(pictured). Like other photonic 
gels, the new material works 
because it forms a crystalline 
array of spheres, which alter 
light reflection by shifting 
orientation when squeezed. 
Adv. Funct. Mater. http://doi.org/ 
rpn (2014) 
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ruminant milk with a symbiotic culture 
of bacteria, including Lactobacillus 


kefiranofaciens, and yeast. Evidence of a kefir 


shards. 


Genes make 
bacteria magnetic 


Researchers have transferred 
genes for the production of 
magnetic nanocrystals from 
one species of bacteria to 
another, a step towards making 
bacterial bioreactors that 
generate such particles. 

Dirk Schueler at Ludwig- 
Maximilians University 
in Munich, Germany, 
Youming Zhang at the 
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dairy — which makes lactose-free products — 
in this region explains why large-scale ruminant 
herding and milking spread in a population 
known to have been lactose intolerant, the 
authors say. The origin of cheese making dates 
back some 4,000 years earlier, but evidence for 
this has relied on analysis of milk fat in pottery 


J. Arch. Sci. http://doi.org/rpq (2014) 


Helmholtz Joint Institute 

at Shandong University in 
Jinan, China, and their team 
focused on roughly 30 genes 
from Magnetospirillum 
gryphiswaldense that 

enable the bacterium to 
produce membrane-bound, 
iron-based magnetic 
nanocrystals. Researchers 
inserted these genes into 
Rhodospirillum rubrum, a 
well-studied organism used in 
biotechnology, that is easier to 
work with. The authors found 
that both sets of magnetic 


YUSHENG LIU 


DONGPENG YANG 


ADAM METALLO/SMITHSONIAN INST. 


particles were similar in size, 
structure and composition. 
Nature Nanotech. http://doi.org/ 
rpg (2014) 


Stubborn microbe 
finds hiding spots 


Salmonella bacteria can escape 
antibiotics and immune- 
system attack by hiding inside 
ahost’s immune cells. 

Roland Regoes and 
Wolf-Dietrich Hardt of the 
Swiss Federal Institute of 
Technology in Zurich and 
their colleagues infected mice 
with Salmonella enterica and 
then treated the animals with 
the antibiotic ciprofloxacin. 
The team found that, after 
most of the infection had 
cleared from organs, about 
10% of the Salmonella bacteria 
inside lymph nodes that drain 
the intestines were still viable 
and growing. These bacteria 
re-established infection after 
antibiotic treatment. 

Ina separate study, Sophie 
Helaine, David Holden, and 
their colleagues at Imperial 
College London found that 
Salmonella cells can also 
persist inside macrophages 
that ingest them. Molecules 
that stimulate certain types 
of immune cells could, in 
combination with antibiotics, 
improve treatment. 

PLoS Biol. 12,e1001793 (2014); 
Science 343, 204-208 (2014) 


Worms bond to 
reach new heights 


To reach passing beetles, 
Parasitic worms congregate 
into towers up to 30 times taller 
than an individual. 
Hans-Joachim Kndélker 
at the Dresden University 
of Technology in Germany, 
Teymuras Kurzchalia at 
the Max Planck Institute of 
Molecular Cell Biology and 
Genetics in Dresden, and 
their team found that juvenile 
Pristionchus pacificus secrete 
a waxy substance, dubbed 
nematoil by the authors, that 
functions as an adhesive 


between animals. This allows 
them to form towers with up to 
1,000 individuals, reaching as 
high as 1 centimetre, which can 
attach to a host. 

Nature Chem. Bio. http://doi.org/ 
rqt (2014) 


MATERIALS 


Waste glass finds 
diffuse use 


A by-product of antiquated 
glass-making processes could 
find a new lease of life as an 
optical diffuser. 
Manufacturers now avoid 
producing crystals, called 
devitrite, in their soda—lime- 
silica glass because they 
degrade the material's optical 
qualities. But Haider Butt at 
the University of Birmingham, 
UK, and his team show that 
this currently undesirable form 
of glass can act as a diffuser, 
owing to the fact that needle- 
like crystals of devitrite scatter 
light to wide angles of up to 
120°. Because devitrite is both 
a highly efficient diffuser 
and cheap to produce, the 
authors say it could be used in 
applications such as medical 
lasers. 
ACS Nano. hitp://doi.org/rpk 
(2014) 


PALAEONTOLOGY 


Algae dealt blow 
to ancient whales 


The unearthing of more than 
40 marine mammal fossils 
(pictured) at a site in Atacama, 
Chile, has revealed that they 
probably died en masse in four 
events due to toxic algae. 
Nicholas Pyenson of the 
Smithsonian Institution in 
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Hopping DNA linked to schizophrenia 


€ HIGHLY READ 


on www.cell.com 
27 Jan-26 Feb 


Mobile DNA elements activated by 
environmental or genetic triggers could 
boost susceptibility to schizophrenia. 


Tadafumi Kato at the RIKEN Brain 
Science Institute in Saitama, Japan, Kazuya Iwamoto at the 
University of Tokyo and their colleagues found higher copy 
numbers of a genetic element, called the L1 retrotransposon, 
in the DNA extracted from the post-mortem brains of patients 
with schizophrenia compared with that from the brains of 
healthy individuals. The authors also showed that the L1 
elements tended to localize to genes linked to schizophrenia 


and neuronal synapses. 


The same phenomenon was observed in mice and monkeys 
exposed perinatally to chemical stressors that are known 
to promote schizophrenia-like behaviours, and in cultured 
neurons derived from stem cells carrying a schizophrenia- 


related chromosomal deletion. 


Neuron 81, 306-313 (2014) 


Washington DC and his 
colleagues found that the 
fossils, including baleen and 
other whales, seals and an 
aquatic sloth, were arranged 
in four distinct layers, which 
are between 9 million and 
6.5 million years old. The 
only modern event known 
to trigger such recurring and 
rapid die-offs of multiple 
species is extreme growth 
of toxic algae. The animals 
could have died after eating 
contaminated prey or have 
been directly affected by the 
algae’s toxin. Other ocean 
areas that once created 

such blooms could harbour 
substantial fossil vertebrate 
remains, the authors say. 
Proc. R. Soc. B 281, 20133316 
(2014) 


Pheromone turns 
on goat brains 


Researchers have pinpointed 
amolecule produced by 
male goats that activates 
reproduction in females 
outside of their normal 
breeding season. 

Yukari Takeuchi at the 
University of Tokyo and her 
colleagues used a special cap to 
capture 18 different pheromone 
molecules emitted from the 
heads of male goats. They 
then exposed females to the 
molecules, and used implanted 
electrodes to monitor activity 
ina brain region that regulates 
reproduction. One molecule, 
4-ethyloctanal, triggered the 
biggest response of all the 
compounds tested. 

This is the first molecule 
shown to stimulate a key 
regulator of reproduction 
in mammals called the 
gonadotropin releasing 
hormone pulse generator. 

Curr. Biol. http://doi.org/rqv (2014) 
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SEVEN DAYS nescnns 


Research relegation 


Switzerland-based scientists 
and companies will have 
restricted access to European 
research funds in 2014. 
Switzerland, which is not a 
member of the European 
Union (EU), has now lost its 
status as an associate partner 
in the EU’s €80-billion 
(US$110-billion) Horizon 
2020 funding programme, 
after it imposed curbs on 
immigration last month (see 
Nature 506, 277; 2014). The 
relegation means that research 
institutes and companies based 
in Switzerland are no longer 
automatically eligible to apply 
for EU funding. Moreover, 
individual scientists cannot use 
basic-research grants from the 
European Research Council at 
Swiss institutes. 


US Navy looks north 
By 2030, the Northern Sea 
Route — a shipping lane 
running through the Arctic 

— could be ice-free and 
navigable for at least nine 
weeks each year, the US 

Navy said on 24 February 

in its ‘Arctic roadmap’. The 
Northwest Passage — an 
alternative shipping route 
through the Arctic — could be 
open five weeks each year, the 
roadmap says. It also projects 
that the region’s waterways 
will see rising activity from 
fishing, tourism, and oil and 
gas exploration, but says the 
development poses little risk 
to US national security. See 
go.nature.com/fv9skt for more. 


DNA swap 

The United Kingdom could 
become the first country 

to legalize mitochondrial 
replacement — a reproductive 
technology that produces 
offspring with three genetic 
parents. The technique 

could prevent children from 
inheriting diseases that 


Snow satellite launch 


A joint US-Japanese mission to monitor rain 
and snow launched from the Tanegashima Space 
Center in Japan on 27 February. The US$933- 
million Global Precipitation Measurement Core 
Observatory (pictured) will map precipitation 
across 90% of the globe every three hours. 


affect mitochondria, the 
cell’s energy producers, by 
transplanting nuclear genetic 
material to a donor cell with 
healthy mitochondria. A 
public consultation on the 
draft legislation is open 
until 21 May. In the United 
States, a Food and Drug 
Administration advisory 
panel examined the science 
behind the technologies on 


25-26 February. See go.nature. 


com/ho8haz for more. 


ITER critique 

The governing council of 
ITER, a multibillion-euro 
international nuclear-fusion 
experiment, has approved 

11 urgent reforms to the 
project's management. The 
executive summary of a critical 
assessment report considered 
at the council’s 13 February 
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meeting was leaked online 

last week by The New Yorker 
magazine (see go.nature. 
com/7ayyqe). It details 
long-standing problems with 
ITER’s complex international 
management (see Nature http:// 
doi.org/nwgq; 2012), but is more 
scathing than past assessments, 
calling for an “achievable and 
realistic schedule” It adds 

that the “schedule has slipped 
so consistently ... almost no 
one ... believes the announced 
dates for first plasma [2027] 

or other key milestones will 

be met”. 


Leaky helmet 

An incident that nearly 
drowned an astronaut on 

a spacewalk last July could 
have been avoided, says a 
NASA report published on 
26 February. Italian astronaut 
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The satellite was built by NASA and the Japan 
Aerospace Exploration Agency (JAXA), and 
the data it will collect could aid the accuracy 
of forecasts of severe weather, estimates of 
freshwater supplies and projections of climate 
change. See go.nature.com/rf9k5s for more. 


Luca Parmitano had to feel his 
way back to the International 
Space Station airlock as more 
than 1 litre of water collected 
in his helmet, covering his 
eyes and nose. The water came 
in through a leak in his suit. 

A week earlier, the same leak 
had occurred, but the crew 
mistakenly thought that the 
water came from his drinking 
tube. The ‘close-call’ spacewalk 
would have been postponed 

if astronauts aboard the space 
station had fully investigated 
the first incident. 


Radiation leak 

The US Department of Energy 
reported on 26 February 

that 13 employees had tested 
positive for low-level radiation 
exposure following a leak 

at its Waste Isolation Pilot 
Plant near Carlsbad, New 


NASA/JAXA 


/A DEPARTMENT OF CULTURAL AFFAIRS 


= 
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Mexico. The employees were 
above ground at the time 

of the 14 February leak, but 
preliminary tests suggest 
that they inhaled radioactive 
particles. Energy department 
officials said the health and 
environmental impacts of 
the leak seem to be minor, 
but traces of plutonium and 
americium were detected as 
far as 1 kilometre from the site. 
See page 15 for more on US 
nuclear waste storage. 


eee PEOPLE en 
Final verdict 


The South Korean Supreme 
Court upheld on 27 February 
a 2010 ruling that sentenced 
disgraced cloning researcher 
Woo Suk Hwang to an 
18-month prison term for 
embezzlement and violation 
of the country’s bioethics law. 
But Hwang will serve jail time 
only if he breaks laws during 
a two-year probation period. 
The court also annulled an 
earlier decision that would 
have forced Seoul National 
University to reinstate Hwang. 
This is the final judgment in 

a trial that started in 2006. 

See go.nature.com/tp7tq9 for 
more. 


Borlaug bronze 

A bronze statue of Norman 
Borlaug (pictured), the 
plant scientist who won the 
Nobel Peace Prize in 1970 
for developing high-yielding 


TREND WATCH 


Biotech firms are increasingly 
seeking partnerships with 
European academic institutes 
and funders, with 314 publicly 
disclosed links in 2013. The 
European Commission and 
Cancer Research UK made 

the most links, according to an 
analysis by the publication SciBX. 
Among companies, AstraZeneca 
had the most public-private 
partnerships, including its deal 
with Sweden's Karolinska Institute 
to create a US$100-million 
cardiac-research centre. See 
go.nature.com/2fwpsq for more. 


wheat crops, will be unveiled 
at the US Capitol building in 
Washington DC on 25 March. 
The state of lowa — where 
Borlaug was born — will install 
the statue, which depicts him 
taking field notes, to mark the 
100th anniversary of his birth. 
Each US state is allowed to 
place two statues of prominent 
citizens in the National 
Statuary Hall Collection. 


PUBLISHING 


Gibberish papers 
The publisher Springer 

will remove 16 computer- 
generated nonsense papers 
that it had published in its 
subscription database, it said 
last week. The papers, created 
by a computer program called 
SClgen, were spotted by 
computer scientist Cyril Labbé 
of Joseph Fourier University in 
Grenoble, France. Last week, 
Nature revealed that Labbé had 
privately informed Springer 

of the problem. He had also 


contacted the US Institute 

of Electrical and Electronic 
Engineers (IEEE), which has 
already withdrawn more than 
100 nonsense articles. The 
papers had been included in 
conference proceedings, some 
of which were supposed to be 
peer-reviewed. See go.nature. 
com/d7h4sr for more. 


Data sharing 

The Public Library of 

Science (PLoS), an open- 
access publisher based in 

San Francisco, California, 

has strengthened its policy 
requiring authors to make 
data publicly available. 

From 3 March, researchers 
publishing in PLoS journals 
must share the data underlying 
their findings as soon as their 
manuscript is published and 
must do so on the Internet, 
for example in public data 
repositories. Some exceptions 
are allowed, for instance in 
cases in which patient privacy 
would be compromised. 


| RESEARCH 
Ageing data 


The US National Institutes 

of Health (NIH) announced 
on 26 February that it had 
added genetic and medical 
data from more than 78,000 
people with an average age 

of 63 to an online genetics 
database, as part of its Genetic 
Epidemiology Research on 
Aging project. The data, 


PUBLIC-PRIVATE PARTNERSHIPS SOAR IN EUROPE 


Links between drug companies and funders or institutions are 
being forged in Europe more than in the United States. 
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Data include double counting because some partnerships involve 
companies and/or institutions from more than one region. 


SEVEN DAYS | THIS WEEK | 


10-11 MARCH 

The Royal Society in 
London hosts a meeting 
on phylogeny, extinction 
risks and conservation to 
discuss how information 
on evolutionary 
relationships can 

be combined with 
metrics such as lists 

of endangered species 

to assess conservation 
needs. 
go.nature.com/pwwz8i 


11 MARCH 

The US Department 

of Agriculture ends 

its extended public 
comment period on 
whether it should allow 
sales of maize (corn) and 
soya bean seeds that are 
genetically engineered 
to resist herbicides such 
as 2,4-D. 


collected with US$25 million 
of NIH funding, will enable 
genome-wide association 
studies that will search for 
genetic signatures linked toa 
variety of diseases, including 
those associated with ageing. 


Repatriated bones 


Two German universities this 
week handed over 32 human 
skulls and three skeletons of 
Namibian origin to Namibian 
officials. The bones derive 
from tribe members who were 
killed by German troops or 
who died in concentration 
camps in the early twentieth 
century when Namibia was 

a German colony. Scientists 
used them for now-discredited 
research to determine 
anatomical differences 
between races. Researchers 
identified the remains among 
thousands of bones in historic 
anthropological collections at 
the University of Freiburg and 
the Charité Medical University 
in Berlin. 


> NATURE.COM 
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www.nature.com/news 


6 MARCH 2014 | VOL 507 | NATURE | 13 


© 2014 Macmillan Publishers Limited. All rights reserved 


ASSOCIATED PRESS 


NEWSIN FOCUS 


Three years New prenatal screens Brazil cracks 
after Fukushima, Japan is at may mean hard choices for down on illegal fossil trade, 
nuclear crossroads p.16 expectant parents p.19 


Researchers 
strive to kick the lithium-ion 


rekindling export debates p.20 | battery from its throne p.26 


NUCLEAR POLICY 


US seeks waste- 
research revival 


Radioactive leak brings nuclear repositories into the spotlight. 


BY JEFF TOLLEFSON 


radiation leak has raised questions 
A= the safety of the United States’ 

only deep nuclear-waste repository, 
and has given fresh voice to scientists calling 
for more research into underground waste 
storage. 

On 14 February, radioactive plutonium and 
americium leaked out of the Waste Isolation 
Pilot Plant (WIPP) near Carlsbad, New 
Mexico, where thousands of drums of con- 
taminated material from the US nuclear-weap- 
ons programme are stored in salt beds more 


than half a kilometre below the surface. The 
health and environmental impacts seem to be 
minor, but 13 employees have tested positive 
for low-level contamination. The Department 
of Energy (DOE) and its contractors are still 
working on a plan to re-enter the WIPP and 
find out what caused the leak. 

The incident also brings renewed atten- 
tion to a problem that policy-makers have 
been avoiding: what to do with a mount- 
ing stockpile of spent fuel from commercial 
reactors, which is currently stored at reactor 
sites. In 2010, the DOE mothballed plans to 
develop Yucca Mountain in Nevada, which 
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since 1987 had been designated as the future 
site of an underground repository (see Nature 
473, 266-267; 2011). Researchers at the DOE 
and universities want to explore a variety of 
alternatives. But they say that they have been 
hobbled by small budgets and the Nuclear 
Waste Policy Act, which prevents the DOE 
from investigating any specific site apart from 
Yucca Mountain. 

“Basically, all of the old ideas have come 
back out of the woodwork,” says Michael 
Driscoll, a nuclear engineer at the Massachu- 
setts Institute of Technology in Cambridge. 
“But the first thing we need is Congress to 
wrestle with this and revise the Nuclear Waste 
Policy Act? 

For now, researchers are pursuing generic 
repository science that does not conflict with 
the law. In one large proposed experiment, 
DOE scientists wanted to assess whether salt 
beds at the WIPP could store radioactive waste 
that is hotter than the material they currently 
hold. In 2011, the team began developing a 
US$31-million experiment that would have 
tested how the salt deforms when it is heated, 
and how water moves through it. 

Other researchers are investigating the 
concept of dropping cylinders of nuclear 
waste into 5-kilometre-deep boreholes in 
hard rock such as granite. Sandia National 
Laboratories in Albuquerque, New Mexico 
is leading a consortium of researchers and 
companies seeking to drill an experimental 
borehole costing approximately $25 million. 
The hot-salt and borehole proposals are now 
competing for funding within the DOE’ rel- 
atively small $15-million annual budget for 
this kind of research. “Big tests like either of 
those would completely overwhelm the cur- 
rent budget,” says Peter Swift, who heads the 
DOE's nuclear-waste science programme at 
Sandia. 

In Europe, scientists have developed exper- 
tise with other types of rock. Finland and 
France have homed in on proposed under- 
ground repositories in granite and shale, 
respectively. Germany has buried low- and 
medium-level wastes in underground domes 
of salt, and it is evaluating the terrain for a 
controversial high-level waste repository. 

International collaboration gives research- 
ers access to the basic science on all of these 
environments, says Jacques Delay, secretary- 
general of the Implementing Geological Dis- 
posal of Radioactive Waste Technology 
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> Platform in Bure, France, a consortium 
that guides a roughly €10-million (US$14- 
million) joint research programme under 
the European Commission. “What is tricky 
is to make the link between the academic 
science and our projects,” he says. 

But basic research can go only so far, 
because the scientific assessment of reposi- 
tory safety is specific to local geology. After 
choosing a site, researchers must study 
the density, porosity and heat conduct- 
ance of the rock there, and characterize 
any fractures and groundwater movement. 
Modelling and experiments help to deter- 
mine how the rock will respond to the heat 
generated by the nuclear waste. 

The United States spent more than 
$15 billion on Yucca Mountain before then- 
energy secretary Steven Chu pulled the 
plug, saying that the site was not a “workable 
option” — broadly interpreted to mean that 
it was doomed politically, if not technically. 
The United States has evaluated few alter- 
natives. The city of Carlsbad, which hosts 
the WIPP, is politically inclined to expand 
its nuclear-waste portfolio. But few other 
communities have shown interest in stor- 
ing nuclear waste. 

Some DOE researchers favour a serious 
exploration of borehole disposal, in part 
because no one has tested the idea, which 
dates back to the 1970s. Swift estimates that 
just 800 boreholes would take care of the 
existing US waste stockpile, as well as spent 
fuel from current reactors until about 2050. 
There is suitable rock at various depths 
across the country. “You could spread 
these things out, 


“Basically, all and you wouldn't 
oftheoldideas have to put all of 
have come your money on one 
back out of the site,” says Patrick 
woodwork” Brady, a geochem- 


ist at Sandia who is 
part of the labs borehole consortium. 

Drilling constraints might limit these 
boreholes to less than 50 centimetres in 
diameter, so spent fuel rods, currently 
stored in large canisters, would need to 
be repackaged. However, a hole that size 
would be perfect for a major source of 
waste that the DOE is trying to dispose 
of: 2,000 highly radioactive capsules 
containing caesium and strontium from 
the Hanford Site, a decommissioned 
plutonium-production facility in Wash- 
ington state. These capsules are 52-56 
centimetres long and up to 9 centimetres 
in diameter, and they contain 38% of Han- 
ford’s radioactivity. Swift says that they 
could all fit into a single borehole. 

With research worldwide concentrating 
on underground repositories, Swift says 
that it is time to try a new concept: “If we 
make a borehole, it will be the one that the 
rest of the world comes and looks at.” m 
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An abandoned train line in Namie, Japan, inside the restricted area around the Fukushima Daiichi plant. 


Japan caught up in 
energy dilemma 


As the third anniversary of the Fukushima disaster nears, 
the nation is faltering in its resolution to shun nuclear power. 


BY DAVID CYRANOSKI 


r | MVhree years after a tsunami led to reactor 
meltdowns at Japan’s Fukushima Dai- 
ichi nuclear power station, the country 

is at a crossroads in terms of energy policy. 

A draft law released last week suggests that, 

despite the previous government's promise of 

a ‘zero-nuclear’ future in the wake of the dis- 

aster, the current administration is ready to 

re-embrace the technology. Yet the announce- 

ment came just weeks before the opening of a 

¥10-billion (US$98-million) renewable-energy 

research centre in Fukushima prefecture that 
aims to be at the forefront of green technology. 

Which way will Japan turn? 

The reformist wave that swept Japan after 
the 2011 disaster included proposals to sup- 
ply all of the country’s energy from renewable 
sources. Nowhere is taking this more seriously 
than Fukushima prefecture, which plans to use 
an array of giant solar panels, biomass plants 
and windfarms to supply all of its energy by 
2040. Two floating, 7-megawatt wind turbines, 
the world’s largest, are scheduled to come 
online in the next year. 

The opening of the Fukushima Renewable 
Energy Institute in April will bolster the pre- 
fecture’s vision. The institute is in Koriyama, 
60 kilometres west of the stricken nuclear plant, 
and has been established by the Tsukuba-based 
National Institute of Advanced Industrial Sci- 
ence and Technology (AIST). It has attracted 
interest from electronics companies such as 
Panasonic and Sharp, and foreign collaborations 
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are also in the pipeline, including one with the 
Fraunhofer Institute for Solar Energy Systems in 
Freiburg, Germany. Inside, about 100 research- 
ers will work across areas including solar, hydro- 
gen, wind and geothermal power. 

Deputy director Tetsuo Munakata says that 
the institute will expand on the long-running 
research programmes of AIST, and he points to 
concrete goals such as reducing the thickness 
of silicon wafers for solar cells to 80 microme- 
tres in seven years. 

But some see the institute as a showpiece with 
little chance of success. Critics point out that Jap- 
anese solar-panel makers are already struggling 
to compete with the lower costs of Chinese man- 
ufacturers. Tetsunari lida, head of the Institute 
for Sustainable Energy Policies in Tokyo, says 
that the Fukushima institute lacks experienced 
hands and will get bogged down with ministerial 
bureaucracy. “I don’t think we can expect much 
in terms of practical results, he says. 

Sceptics also point to what seems to bea gov- 
ernment move towards nuclear energy. Despite 
continued leaks of radioactive water stored at 
Fukushima Daiichi, and extensive exclusion 
zones remaining around it (see ‘Fukushima: 
the legacy’), the draft energy plan says that the 
government will push to restart Japan's 48 oper- 
able reactors, all of which were closed after the 
earthquake that triggered the disaster. The plan 
is expected to be approved in the spring. 

But Munakata says that the new institute also 
underlines the government's commitment to 
renewable energy. “There's no way it ist going 
to keep investing in renewables,” he says. = 
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FUKUSHIMA: » 
Py CANCER RISKS , 


Radiation exposure in 


Fukushima poses only a 
negligible cancer risk. For 


example, a study last month 


The region around the Fukushima Daiichi showed that residents in the 
nuclear power plant is still subject to worst-affected residential 
exclusion zones that roughly mirror the areas areas, including Tamano, have 
of greatest contamination. This map shows only a 1% higher lifetime risk 


the perimeters and radioactivity 1 metre of developing solid cancers. 
above the ground in September 2013. Tamano ~ 


OFFSHORE RADIATION 


Y e | In December 2013, the International Atomic 
Fukushima | | Energy Agency found that the level of 
Daiichi | i caesium-137 in the ocean around the 
Fukushima plant was 1 becquerel per litre 
(Bq L?). World Health Organization guidelines 
Air radiation rates state that the maximum level of caesium-137 
(microsieverts per hour) in drinking water should be 10 Bq L1. 
ale N 20 km 
H 9.5-19 ae 
1.0-9.5 Sa 
0.2-1.0 EXCLUSION ZONES io 30 ee OCEAN POLLUTION 
Officially, 154,000 people The radiation levels farther offshore, between 
Exclusion zones have been evacuated from 2 and 20 kilometres from the port, are now 
OI No public access their homes near the plant. mostly below 0.1 Bq L? for caesium-137. 
; None have yet been allowed Even farther out, the levels are close to what 
O Restricted access to return permanently, but they were before the disaster: 0.001-0.003 
Free movement, over the next two years up to LZ Bq L" for caesium-137. 
no overnight stays 30,000 may be allowed back. Ase al aise rs 


According to the World Health 
Organization, residents living in 
the hardest-hit areas of 
Fukushima prefecture were 
subject to radiation levels of 12 
to 25 millisieverts (mSv) in the 
first year. That is equivalent to 
two or three medical CAT scans. 


SLEEPING GIANT 48 5 
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third largest producer of 537 \ A434 70 


nuclear power and had big J Pre-Fukushima 

plans to increase its output by estimate 

2030. The Fukushima disaster 280 Reactors Reactors under 
in 2011 has resulted in all of alae seria 
its reactors being turned off, eid pies 


and its expansion plans are on 
hold, although the current 
government may change that. 
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Crystalline silica dust released during construction work can cause serious lung damage. 


ENVIRONMENTAL HEALTH 


Dust regulations 
trigger backlash 


US agency’s reassessment of silica exposure rules 
provokes conflict-of-interest row with senators. 


BY DANIEL CRESSEY 


a US government agency to defend its 

actions over a proposed tightening of 
regulations concerning industrial workers’ 
exposure to deadly silica dust. 

The row blew up late last year when the 
Occupational Safety and Health Administra- 
tion (OSHA) began a public consultation on 
setting new limits for working with the dust, 
which is a major hazard for construction work- 
ers, causing serious lung disease. The agency 
ruffled feathers in the Senate when it asked that 
those submitting evidence should declare their 
funding sources. 

Last November, a group of 16 senators wrote 
an open letter to OSHA criticizing the move for 
its implication that the agency might prejudge 
submissions. The consultation period closed 
on 11 February, and OSHA is now vigorously 
defending its request. 

“What I’m doing here is essentially saying 
the information that we will base our stand- 
ard on has to be of the highest integrity, and 
we have to do it ina transparent manner, and 
conflict-of interest disclosure is an important 


S enate accusations of prejudice have forced 
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component of both of those? David Michaels, 
the head of OSHA, told Nature. “It would be 
surprising right now if a scientific journal 
didn’t ask for that information” 

Produced by tasks such as grinding concrete 
and sandblasting, used in the construction and 
other industries, crystalline silica dust can 
cause silicosis — an incurable disease involv- 
ing inflammation of the lungs — and lung 
cancer. The dust is thought to kill or disable 
thousands of people in the United States every 
year, but guidelines on working with it have not 
been updated for more than 40 years. 

“Our current standard is antiquated,” says 
Michaels. “There are literally millions of 
workers in the United States who are exposed 
to dangerous levels of silica.” 

The present rules generally advise limiting 
exposure to roughly 100 micrograms of crys- 
talline silica per cubic metre of air, averaged 
over 8 hours. OSHA has proposed halving 
this limit. Workers would also have to be bet- 
ter protected, for example by dust being ‘wet- 
ted down’ and with the use of extraction fans. 
OSHA estimates that the new regulations will 
cost about US$640 million a year, with employ- 
ers picking up most of the tab, but the agency 
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believes that the rules will save up to 700 lives a 
year. US standards are also influential in other 
countries, some note, potentially saving many 
more workers lives. 

The proposals were published in the Federal 
Register last September, at the start of the 
consultation period. In a first for OSHA, those 
wishing to submit scientific evidence as part of 
their comments were requested — although 
not required — to provide information on the 
funding sources of the research, as well as any 
funding received by the commenters that could 
potentially be considered a conflict of interest. 

The Associated General Contractors of 
America, an industry group based in Arling- 
ton, Virginia, called the proposals “significantly 
flawed” and “rife with errors and inaccurate 
data”. And shortly after they were published, 
the group of senators, led by Lamar Alexander 
(Republican, Tennessee), a senior member of 
the Senate Committee on Health, Education, 
Labor, and Pensions, wrote to OSHA saying 
that they were “very concerned about OSHASs 
attempt to have commenters disclose their 
financial backers”. They added that the request 
“raises questions” about whether OSHA would 
prejudge submissions on the basis of who was 
sending them. 

“The chilling effect the financial disclosure 
could have seems counter to the idea of robust 
inclusion of a diverse set of ideas and views to 
inform the rule-making,’ Liz Wolgemuth, a 
spokeswoman for Alexander, told Nature. 

But pharmacologist Lisa Bero of the Uni- 
versity of California, San Francisco, says that 
her own research on similar rule-making pro- 
cesses for tobacco control found that scientists 
opposing rules were often funded by indus- 
try groups. She supports the new disclosure 
request. “The regulatory agencies have to be 
in a position to critically appraise the studies 
that come to them,” she says. 

There is also support for the new silica 
standard. Tee Guidotti, a physician in Wash- 
ington DC and a member of the American 
Thoracic Society’s Environmental Health 
Policy Committee, says that the scientific case 
for the proposed limit is “close to being bullet- 
proof”. He adds that, if it is successful, it could 
provide a template for how OSHA deals with 
similar hazards, such as dust and radon. 

But Susan Dudley, director of George 
Washington University’s Regulatory Studies 
Center in Washington DC, which conducts 
independent research on regulatory propos- 
als, argues that there has already been a drop 
in exposure to silica dust and its health effects 
in recent years. She supports a lower exposure 
limit, but believes evidence is weaker for some 
of the specific requirements proposed to reach 
it, such as dust wetting. 

The viewpoints contained in the 1,600 or so 
comments received through the consultation 
will be discussed in public hearings starting 
on 18 March. It will probably be several years 
before a final rule is enacted. m SEE EDITORIALP.8 
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Prenatal-screening companies 
expand scope of DNA tests 


But the increasingly accurate analyses carry the ethical dilemma of uncertain outcomes. 


BY ERIKA CHECK HAYDEN 


hen the small piece of chromo- 
some 22 associated with DiGeorge 
syndrome is missing, this can 


cause developmental abnormalities such as 
speech and language delays, cleft palate, a 
faulty immune system and heart defects. Or an 
apparently healthy baby can be born. 

With non-invasive genetic screening on the 
rise, companies are racing to add chromosomal 
abnormalities such as that associated with 
DiGeorge syndrome to the list of disorders for 
which they can test. But because the effects of 
these genetic anomalies are sometimes uncer- 
tain, the test results may prompt difficult deci- 
sions for families who receive such diagnoses. 

The tests are a welcome development, says 
Philippa Brice, communications director of 
the non-profit PHG Foundation in Cambridge, 
UK, which promotes the responsible use of bio- 
medical science. But she adds, “there is a risk 
that women may receive early in pregnancy a 
result they do not fully understand”. 

The issue is becoming more common as 
genetic-screening companies demonstrate the 
accuracy of the tests — and press their case that 
all pregnant women should be screened. On 
26 February, scientists at Iumina, a genome- 
sequencing company headquartered in San 
Diego, California, announced the results 
of a study comparing its genetic test, called 
verifi, with the standard biochemical screen- 
ing routinely offered to pregnant women 
(D. W. Bianchi et al. N. Engl. J. Med. http://doi. 
org/rqn; 2014). The genetic test checks directly 
for abnormalities in fetal DNA that crosses the 
placenta and enters the mother’s bloodstream, 
whereas the biochemical screen estimates the 
risk of chromosomal disorders indirectly, by 
measuring proteins and hormones in the moth- 
er’s blood that are linked with higher rates of 
birth defects. If either screen suggests the pres- 
ence of a defect, it is usually confirmed by an 
invasive test, such as an amniocentesis. 

The study, which was funded by Illumina, 
screened more than 1,900 US women for 
chromosomal abnormalities, including Down's 
syndrome, using both the biochemical and 
genetic methods. The verifi test had a false-pos- 
itive rate of 0.3%, about one-tenth of that associ- 
ated with biochemical screening. The finding is 
crucial because most physicians groups recom- 
mend that screening be used only in women at 


DNA deletions in chromosome 22 (indicated by pink markers) are associated with DiGeorge syndrome. 


high risk of carrying a fetus with chromosomal 
disorders — mainly women aged over 35 — to 
minimize the chance that a false positive would 
lead to an invasive test. The study examined 
women with an average age of just under 30. 
With such a low false-positive rate, the 
genetic screening should be offered to women 
of all ages, says Anthony Gregg, a geneticist 
and obstetrician at the University of Florida in 
Gainesville who was the lead author of genetic- 
screening guidelines published last April by the 
American College of Medical Genetics and 
Genomics in Bethesda, Maryland. 
Increased acceptance could expand the 
already growing market for genetic screen- 
ing. Sequenom, a 


“The companies sequencing company 
are all trying based in San Diego, 
to one-up each said in January that it 
other.” performed 148,500 of 


its MaterniT21 tests in 
2013, up 140% from 2012. And in November, 
Ariosa of San Jose, California, said that it had 
performed its Harmony test on 150,000 women 
since launching in May 2012. Most tests are per- 
formed in the United States, although the com- 
panies are preparing to roll them out globally. 
Companies are trying to stand out by 
expanding the number of conditions their 
tests check for. Microdeletions and micro- 
duplications — genetic defects that can be 
millions of base pairs long — are seen as 
the first step, although some companies 
have designs to sequence the entire fetal 
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genome. “The companies are all trying to 
one-up each other,” says Wendy Chung, 
a paediatrician and geneticist at Columbia 
University in New York City. “It’s not clear 
where the number will stop? 

Typically, genetic and biochemical screen- 
ing tests have scanned only for missing or 
extra copies of whole chromosomes, which are 
known to result in clear outcomes. An extra 
copy of chromosome 21, for instance, causes 
Downs syndrome. But last October, Seque- 
nom began looking for microdeletions within 
chromosomes with its MaterniT21PLUS test. 
Natera, based in San Carlos, California, added 
tests for microdeletions to its Panorama test last 
month. And Illumina says that it will follow suit 
and expand the capabilities of its verifi test in 
the next few months. “Nobody in the twenty- 
first century should have a pregnancy without 
being screened for these microdeletions,” says 
Matthew Rabinowitz, Natera’s chief executive. 

Routine checking for microdeletions could 
enable more families to prepare for children 
with special needs. But because such micro- 
deletions are smaller than whole-chromosome 
abnormalities, they affect fewer genes, and the 
physical or mental effects are not always predict- 
able. Chung, for instance, knows of many adults 
who found out that they had DiGeorge syn- 
drome only after their children were diagnosed 
and they decided to get tested themselves. She 
says that as tests expand their scope and offer 
less-certain predictions, more families will face 
tough choices about their unborn children. = 
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Rare fossils, such as this pterosaur from Brazil’s Araripe Basin, are regularly seized by authorities. 


PALAEONTOLOGY 


Brazil clamps down 
on illegal fossil trade 


Debate rages as collectors accused of exporting specimens 


face up to 20 years injail. 


BY ELIZABETH GIBNEY 


hirteen people are scheduled to go on 

| trial in Brazil for smuggling fossils out 

of the country, apparently to private col- 

lectors and to museums in Germany and the 
United Kingdom. 

The arrests, which followed the seizure of 
around 1,000 fossils by French authorities in 
October, are part of a growing effort by Brazil- 
ian authorities to crack down on illegal fossil 
trading. Although the police have not released 
the names of the accused, authorities say that 
one, a German national, works for palaeonto- 
logical museums in the United States, Europe 
and China. If convicted, some of the individu- 
als face a maximum of 20 years in jail. 

The case, which is still awaiting a trial date, 
has reopened the debate about the fossil trade, 
and whether science in developing countries 
benefits from protectionist policies. 

In Brazil, it is illegal for anyone not affili- 
ated with a national or state research institu- 
tion to dig for fossils without authorization 
from the Brazilian National Department of 
Mineral Production (DNPM). The law dates 
back to 1942, but official permits are rarely 
given out. Brazil’s constitution also says that 
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fossils found in the country are the property 
of the state, making their sale or export with- 
out permission illegal. “Despite this, people 
are taking fossils out of the country in tonnes 
per month,” says Max Langer, president of the 
Brazilian Society of Palaeontology. 

The fossil-rich Araripe Basin in northeastern 
Brazil has long been a source of black-market 
specimens. Langer says that he is not aware of 
the DNPM ever granting a licence to foreign 
collectors there, meaning that most Araripe 
fossils now abroad were obtained illegally. 

Many Brazilian palaeontologists broadly 
support the law. They argue that keeping fossils 
in the country preserves cultural heritage and 
helps to develop Brazil’s palaeontological exper- 
tise. Illegal collecting also damages specimens 
and prevents the acquisition of contextual field 
data, some say. 

But the crackdown is worrying some pal- 
aeontologists outside Brazil, who say that the 
laws hamper science. “Scientists who just want 
to go about doing science are frustrated, says 
David Martill, a palaeontologist at the Univer- 
sity of Portsmouth, UK. 

The enforcement has been driven by Brazil’s 
growing economy and strength in palaeontol- 
ogy, as well as an increasing public awareness 
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of fossil sales and loss of cultural heritage, says 
Alexander Kellner, a palaeontologist at the 
National Museum of Brazil, part of the Federal 
University of Rio de Janeiro. Authorities, for 
example, were quick to act last month when 
the online auction site eBay listed a Brazilian 
pterosaur for sale at US$262,000. The auction, 
by French company Geofossiles, closed without 
sale, and Brazilian and international police are 
reportedly investigating the case. Brazil could 
ask France to repatriate the fossil if authorities 
can prove that it was found in the country. 

Researchers say that the loss of fossils has 
seriously harmed Brazilian science. For 
instance, in findings presented at the 7th Bra- 
zilian Symposium of Vertebrate Palaeontology 
in 2010, Marcos Sales, a palaeontologist at the 
Federal University of Rio Grande do Sul in 
Porto Alegre, found that 14 of the 24 reference 
specimens, or holotypes, of pterosaur fossils 
collected in the Araripe Basin were housed by 
foreign institutions. 

The issue is not unique to Brazil. Other 
countries that list fossils as national prop- 
erty and prohibit their export, such as China, 
Mongolia and Morocco, are increasingly 
enforcing their own laws. In 2012, for example, 
Mongolia successfully fought for the return of 
a 7.3-metre-long tarbosaur fossil, which had 
been illegally exported to the United States and 
sold there for more than $1 million. 

But critics such as Martill argue that crimi- 
nalizing the fossil trade merely sends it under- 
ground. Crackdowns cut the supply line 
because informal diggers, such as quarry work- 
ers, fear arrest. Knowing “dodgy” people is the 
only way to get samples, says Martill, because 
the DNPM ignores requests to dig. He adds 
that palaeontology needs commercial trade 
because without it there is little excavation. 

David Unwin, a palaeobiologist at the Uni- 
versity of Leicester, UK, says that certain fossils 
are so abundant that there is no harm done if 
some are taken abroad. He also notes that the 
cost of preparing and displaying fossils is very 
high. “Ifa country is relatively poor and can- 
not afford those kinds of things, it is perhaps 
better that those fossils go somewhere else,” he 
says. Langer calls this a “very first-world view”. 

Kellner admits that the Brazilian law is not 
perfect. He was a victim of it in 2012, when he 
was arrested while transporting fossils within 
Brazil, although the charges were dropped. He 
still supports the law, but says that the govern- 
ment’s crackdown must be accompanied by 
a drive to give Brazilian palaeontologists the 
funding to collect fossils. Others say that the 
DNPM lacks the resources to enforce the law. 

Langer feels that fossils must be kept in the 
country to help to improve Brazilian science. 
And he expects fellow researchers to hold 
Brazil's laws in higher regard than the private 
collectors who also fuel the trade. “Foreign 
researchers know it’s illegal. They could work 
here in Brazil with Brazilian partners, but they 
choose not to,” he says. m 
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Budget woes hinder 
clinical-trials network 


Costly cancer trials a challenge for revamped US programme. 


BY HEIDI LEDFORD 


US cancer clinical-trials programme 

came to an end on | March. But many 
fear that the programme replacing it — 
although designed to be nimbler and more 
focused — will be unable to conduct the types 
of trial needed in an era of expensive, personal- 
ized cancer treatments. 

“We've got an exciting new structure and 
great science,’ says Monica Bertagnolli, an 
oncologist at the Dana-Farber/Brigham and 
Women’s Cancer Center in Boston, Massachu- 
setts, who serves on the new National Clinical 
Trials Network (NCTN). “But the severe budg- 
etary constraints are forcing us to really limit 
what studies are being done.” 

For more than half a century, the Clini- 
cal Trials Cooperative Group Program, run 
by the US National Cancer Institute (NCI) 
in Bethesda, Maryland, has been the main 
government-funded mechanism for carrying 
out trials that drug companies avoid because 
the results are unlikely to boost profits. The 
programme has had notable successes: it 
helped to boost the cure rate of childhood 
cancers from less than 10% to almost 80%; 
determined that the drug tamoxifen could 
halve the incidence of breast cancer in women 
at high risk of the disease; and found effective 
alternatives to radical mastectomies for early- 
stage breast cancers. 

But the cooperative group programme 
has also been criticized for its inefficiency 
at a time when cancer research is advanc- 
ing rapidly. Tumours have been divided into 
subtypes on the basis of their genetic muta- 
tions, and there is growing interest in testing 
tailor-made therapies. To achieve that, many 
patients must be screened to find those with 
the right cancer types. 

In theory, the cooperative group pro- 
gramme, which included more than 3,000 
institutions and some 14,000 investigators, 
could have provided the power and reach 
needed to find those patients. But a 2010 report 
by the US Institute of Medicine pronounced 
it unfit for the challenge. The institute found 
that many trials took so long to launch that, 
by the time they did, they no longer reflected 
cutting-edge science. Only about 60% of the 
programme’s trials were ever completed. 

In response, over the past four years, the 
NCI has been readying a replacement: the 


| i ollowing years of criticism, a nationwide 


NCTIN. Ten cooperative groups have been 
consolidated to five, and there is now a uni- 
fied system for data management and ethical 
review. The NCTN also has strict deadlines for 
trial initiation. “Ifa study doesn’t meet those 
timelines, it’s dead,” says Bertagnolli. The time 
taken to start a large clinical trial has already 
dropped from two years to one — in line with 
the time it takes pharmaceutical companies 
to launch trials, says James Doroshow, deputy 
NCI director for clinical and translational 
research. 

But the network's budget, just like that of the 
old programme, falls far short of those of its 
industry counterparts. The NCI reimburses 
member institutions by about US$2,000 
per enrolled patient — much less than the 


$20,000 per patient 
“What if a that pharmaceutical 
critical scientific | companies provide, 
opportunity says Philip DiSaia, 
arises that an oncologist at 
cannot be the University of 
addressed?” California, Irvine, 


who was chair of a 
gynaecological oncology cooperative group. 
Asa result, some institutions have threatened 
to pull out of the network, says DiSaia, and oth- 
ers have capped the number of patients they 
will enrol in an NCTN trial. 

At some clinical-trial sites, the NCI plans 
to boost reimbursement rates to $4,000 per 
patient. But that, combined with a flat budget, 
means that the NCTN will admit only 17,000 
patients in 2014, down from 21,000 admitted 
last year under the old system. And that could 
limit the kinds of trials the network can take 
on, says Robert Comis, co-chair of an NCTN 
group and an oncologist at Drexel University 
in Philadelphia, Pennsylvania. “What ifa criti- 
cal scientific opportunity arises that cannot be 
addressed because of the quotas?” = 


CORRECTION 

In the News Feature ‘Smart enough to die?’ 
(Nature 506, 284-286; 2014), it was the 
Virginia court system, not the US Supreme 
Court, that changed Daryl Atkins’s sentence 
to life imprisonment. The Supreme Court 
also did not mandate that rulings on 
intellectual disability in death-penalty cases 
should follow the standards of the American 
Association on Mental Retardation. 
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THE S/NS OF THE 


FATHER 


The roots of inheritance may extend beyond the genome, 
but the mechanisms remain a puzzle. 


hen Brian Dias became 

a father last October, he 

was, like any new parent, 
W mindful of the enormous 

responsibility that lay 

before him. From that 

moment on, every choice 
he made could affect his newborn son’s physi- 
cal and psychological development. But, unlike 
most new parents, Dias was also aware of the 
influence of his past experiences — not to 
mention those of his parents, his grandparents 
and beyond. 

Where one’s ancestors lived, or how much 
they valued education, can clearly have effects 
that pass down through the generations. But 
what about the legacy of their health: whether 
they smoked, endured famine or fought in 
a war? 

As a postdoc in Kerry Ressler’s laboratory 
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at Emory University in Atlanta, Georgia, Dias 
had spent much of the two years before his 
son's birth studying these kinds of questions 
in mice. Specifically, he looked at how fear 
associated with a particular smell affects the 
animals and leaves an imprint on the brains of 
their descendants. 

Dias had been exposing male mice to aceto- 
phenone — a chemical with a sweet, almond- 
like smell — and then giving them a mild foot 
shock. After being exposed to this treatment 
five times a day for three days, the mice became 
reliably fearful, freezing in the presence of 
acetophenone even when they received no 
shock. 

Ten days later, Dias allowed the mice to 
mate with unexposed females. When their 
young grew up, many of the animals were more 
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sensitive to acetophenone than to other odours, 
and more likely to be startled by an unexpected 
noise during exposure to the smell. Their 
offspring — the ‘grandchildrer’ of the mice 
trained to fear the smell — were also jumpier 
in the presence of acetophenone. What’s more, 
all three generations had larger-than-normal 
‘M71 glomeruli; structures where acetophe- 
none-sensitive neurons in the nose connect 
with neurons in the olfactory bulb. In the 
January issue of Nature Neuroscience’, Dias and 
Ressler suggested that this hereditary trans- 
mission of environmental information was the 
result of epigenetics — chemical changes to the 
genome that affect how DNA is packaged and 
expressed without altering its sequence. 
Biologists first observed this ‘transgen- 
erational epigenetic inheritance’ in plants. 
Tomatoes, for example, pass along chemical 
markings that control an important ripening 


20TH CENTURY FOX/THE KOBAL COLLECTION 


gene’. But, over the past few years, evidence 
has been accumulating that the phenomenon 
occurs in rodents and humans as well. The 
subject remains controversial, in part because 
it harks back to the discredited theories of 
Jean-Baptiste Lamarck, a nineteenth-century 
French biologist who proposed that organ- 
isms pass down acquired traits to future 
generations. To many modern biologists, 
that’s “scary-sounding”, says Oliver Rando, 
a molecular biologist at the University of 
Massachusetts Medical School in Worcester, 
whose work suggests that such inheritance 
does indeed happen in animals’. If it is true, 
he says, “Why hasn't this been obvious to all 
the brilliant researchers in the past hundred 
years of genetics?”. 

One reason why many remain sceptical is 
that the mechanism by which such inherit- 
ance might work is mysterious. Explaining it 
will require a deep dive into reproductive biol- 
ogy to demonstrate how the relevant signals 
might be formed in the germ line, the cells that 
develop into sperm and eggs and carry on, ata 
minimum, a person's genetic legacy. 

A mother might pass on effects of environ- 
mental exposures to a fetus during pregnancy. 
So, to study the phenomenon of transgenera- 
tional epigenetics cleanly, biologists are focus- 
ing on fathers, and have been looking at how 
sperm might gain and lose epigenetic marks. 
“In the past two to three years there’s been a 
lot of new information,’ says Michelle Lane, 
a reproductive biologist at the University of 
Adelaide in Australia. But proposals for how 
it all works are themselves embryonic. “It’s a 
huge black box,” Lane says. 


MONSTER PLANTS AND OBESE CHILDREN 

The epigenetics revolution hit in the early 
2000s, when scientists began reporting that 
environmental factors — everything from 
neglectful mothering and child abuse to a 
high-fat diet and air pollution — can influ- 
ence the addition or removal of chemical tags 
on DNA that turn genes on and off. This idea 
ofan environmentally responsive genome still 
stirs debate (see Nature 467, 146-148; 2010). 
But the notion that epigenetic marks are 
transmitted across generations is even more 
provocative. 

Swedish botanist Carl Linnaeus was among 
the first to spot changes resulting from this 
phenomenon. In the 1740s, he received a 
plant specimen that looked very similar to 
common toadflax (Linaria vulgaris), but 
with very different flowers. Linnaeus was 
shocked because this challenged his theory 
that plant species could be categorized by the 
structure of their flowers. “This is certainly 
no less remarkable,’ he wrote, “than if a cow 
were to give birth to a calf with a wolf’s head.” 
He named the plant Peloria, after the Greek 
word for ‘monster. 

In the 1990s, plant biologist Enrico Coen at 
the John Innes Centre in Norwich, UK, found 


that in the monster plants, methyl groups litter 
a gene involved in flower structure called Lcyc, 
completely shutting it down. (DNA methyl- 
ation usually turns genes off.) Coen’s team also 
showed that these methyl marks pass through 
seeds to later generations’. 

The public first started to take notice in the 


“WHY HASN'T THIS 
BEEN OBVIOUS TO 
ALL THE BRILLIANT 
RESEARCHERS IN 
THE PAST HUNDRED 
YEARS OF GENETICS?” 


mid-2000s, after large epidemiological investi- 
gations in Europe began to show transgenera- 
tional effects in humans. One study of Swedish 
historical records showed that men who had 
experienced famine before puberty were less 
likely to have grandsons with heart disease 
or diabetes than men who had plenty to eat’. 
Similar work with children in Britain reported 
in 2005 that fathers who had started smoking 
before the age of 11 had an increased risk of 
having boys of above average weight°. 

But many scientists remained sceptical. 
Epidemiological studies are often messy, and 
it is impossible to rule out all confounding 
variables. In the past few years, however, sev- 
eral studies in rodents have supported these 
observations and begun to attribute the trans- 
mission of various traits to changes in sperm. 


SPERM SIGNATURES 

Male rats fed a high-fat diet, for example, beget 
daughters with abnormal DNA methylation in 
the pancreas’. Male mice fed a low-protein diet 
have offspring with altered liver expression of 
cholesterol genes*. And male mice with pre- 
diabetes have abnormal sperm methylation, 
and pass on an increased risk of diabetes to 
the next two generations*. 

“We and many other people have now 
shown these paternal effects,’ says Rando, who 
led the low-protein study. “And we're all having 
a hell of a time figuring out how they work.” 

The animal studies have triggered some 
strong debate. The most controversial results 
have come out of Michael Skinner’s lab at 
Washington State University in Pullman. 
Skinner's team exposed pregnant rats to large 
doses of pesticides and fungicides, which led 
to organ damage in their adult offspring. The 
sperm of male offspring showed changes in 
DNA methylation that persisted for at least 
four generations’. 

But at least two groups failed to replicate the 
data, and in 2010, federal investigators found 
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that one of Skinner’s postdocs had fabricated 
data for a related paper, which the authors 
had retracted in 2009. Skinner says that some 
teams have replicated his results, and that those 
who have not were using inappropriate proto- 
cols. Last year, his own team reported success- 
fully reproducing the results of the retracted 
paper’. 


METHYLATION MECHANISM 

Explaining how transgenerational epigenetics 
works has been difficult in part because most 
studies track outcomes — such as changes in 
glucose, cholesterol and fertility — that can be 
affected by a range of factors, making it tricky to 
tease out cause and effect. By contrast, Dias and 
Ressler’s work with acetophenone takes advan- 
tage of specific biology: the chemical binds toa 
particular receptor in the nose that is encoded 
bya single gene, dubbed Olfr151. “This is the 
massive pro of their study,’ Rando says. 

Dias and Ressler do not claim to understand 
exactly what is going on, but they do have a 
working hypothesis. Somehow, the infor- 
mation about the frightening smell gets into 
a mouse’s testes and results in lower meth- 
ylation of the O/fr151 gene in sperm DNA. 
The researchers even ran experiments using 
in vitro fertilization to make sure that the 
father was not in some way passing on a fear 
of acetophenone through interactions with the 
mother. The epigenetic tweak in the sperm is 
perpetuated in the offspring’s DNA, leading 
to increased expression of the receptor in the 
animals’ noses and, ultimately, enhanced sen- 
sitivity to the smell. 

But the chain of causation is loose. “There 
are a lot of disconnects there,’ says William 
Kelly, a developmental geneticist at Emory. “It’s 
not beyond the realm of possibility or plausi- 
bility. It’s just right now we don’t know enough 
about how information is transferred between 
generations.” 

The first question is how the effects of envi- 
ronmental exposure become embedded in an 
animal's germ cells — in this case, the mouse’s 
sperm. Germ cells have been shown to express 
olfactory receptors"'. So it is possible that 
Olfr151 receptors in sperm respond to odorant 
molecules in the bloodstream and then change 
the methylation of the corresponding gene in 
sperm DNA. 

Alternatively, after being exposed to the 
odour and the pain, a mouse might produce 
RNA molecules — perhaps in the brain — that 
make their way into the bloodstream and then 
selectively target the Olfr151 gene in sperm. 
Many studies in plants have hinted at this sort 
of systemic RNA shuttling. RNA molecules 
expressed in a plant’s leaf, for example, can 
travel through its vascular system to many of 
its other tissues and affect gene expression”. 

But creating an epigenetic mark in the 
sperm is only the first step. To pass down 
through multiple generations, the signal 
needs to survive multiple rounds of rigorous 
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epigenetic reprogramming. In mammals, the 
first of these happens just hours after concep- 
tion, when most methylation is stripped from 
sperm DNA in the single-celled embryo. Then, 
as the embryo develops and divides, and cells 
begin to differentiate into various tissue types, 
methylation is gradually re-established. But 
even if some signal from the father were to 
survive this process, the embryo’s own primor- 
dial germ cells, those that eventually become 
its sperm or eggs, undergo a second round of 
epigenetic scrubbing (See ‘Without a trace’). 

Some genes manage to escape these periods 
of major reprogramming. The best example is 
genes that are imprinted — whereby one copy 
from the mother or father is robustly methyl- 
ated and effectively silenced. These silencing 
marks crop up in the egg or sperm and are 
retained in the embryo. 

About 100 genes are known to be imprinted, 
but some non-imprinted genes may also 
escape the scrubbing through a similar mecha- 
nism. “There is a growing consensus that there 
are more regions than previously thought that 
escape reprogramming in sperm,’ says Sarah 
Kimmins, an epigeneticist at McGill University 
in Montreal, Canada. “Why this is, and how, is 
not yet known, although studying imprinted 
genes may reveal clues.” 

Then again, even if Olfr151 does escape 
reprogramming, it is hard to explain how that 
could lead to a noticeable difference in the 
behaviour of fully formed offspring. Dias and 
Ressler reported that in sperm samples from 
mice trained to fear acetophenone, about 86 
out of every 100 sperm show O/fr151 methyl- 
ation, whereas in mice trained to fear a differ- 
ent odour it is about 95 out of every 100. This 
difference is statistically significant, but fairly 
small. And yet the behavioural effects in the 
second generation were robust: about half of 
the acetophenone-trained animals’ offspring 
showed increased sensitivity to the odour. 


“SOMETHING GOOFBALL’? 

Although many are scratching their heads over 
the holes in the proposed mechanism, few are 
suggesting that the underlying phenomenon 
is a fairy tale. “Impossible things are happen- 
ing every day,’ says Kelly, quoting a line from 
Rodgers and Hammerstein's Cinderella. 

It is possible, for example, that the DNA- 
methylation tweaks reported in the odour 
study are simply a by-product of an altogether 
different mechanism. 

One route might be chemical marks on 
histones, the proteins around which DNA 
wraps. Acetyl and methyl groups can attach 
to histones and affect the expression of nearby 
DNA. But during sperm-cell formation, DNA 
is stripped of most of its histones (and their 
attendant marks) and wraps instead around 
protamines, which pack it more tightly. 

Nevertheless, about 10% of human histones 
—and about 1% of mouse ones — are retained. 
These sites might carry information from one 
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WITHOUT A TRACE 


Researchers are struggling to understand how epigenetic 
marks, such as DNA methylation, could pass from one 
generation to the next in mammals. During development, 


the cells destined to become sperm undergo multiple 
rounds of epigenetic scrubbing and reprogramming. 


DNA repackaged 
with different 


DNA 
methylation 
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DNA 
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generation to the next. In 2011, researchers 
reported that, in nematode worms, certain 
histone marks correlate with long life and 
can be passed down through several genera- 
tions!’. And last December, Kimmins and her 
colleagues showed that feeding male mice a 
diet low in folate — a nutrient that provides 
the raw materials for methylation — led to 
significantly reduced methylation of histone 
proteins in the animals’ sperm and more birth 
defects in their offspring”. 

Still other studies point to a mechanism 
involving short RNA molecules latching on to 
DNA and affecting gene expression. Twenty- 
eight microRNAs are expressed differently in 
the sperm of men who do and do not smoke, 
according toa study reported in 2012 (ref. 15). 
And these RNA patterns may persist through 
multiple generations. Last year, Lane’s group 
found that obese male mice show abnormal 
expression of 11 microRNAs in their sperm — 
and that they pass on insulin resistance to the 
next two generations”®. 

Then there is the possibility that the mecha- 
nism is, as Rando puts it, “something goofball”. 
That might be prions — misfolded proteins 
that act as infectious agents — which have been 
shown to transmit heritable traits in budding 
yeast (see Nature 482, 294-296; 2012). Or it 
could be something in semen besides sperm. 
Researchers reported in January” that mice 
born of fathers lacking seminal vesicles are 
fatter and have more metabolic problems than 
controls, suggesting that molecules in seminal 
fluid influence gene expression in sperm and 
the female reproductive tract. 

If the mechanism involves DNA 
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methylation, histones or RNA, the field is 
likely to make great progress in the next few 
years, Rando predicts. “But if it’s something 
completely novel,’ he says, “Maybe it will take 
decades to figure out” 

Dias has his fingers crossed for the former. He 
is going to Boston, Massachusetts, in April fora 
Keystone meeting on epigenetic inheritance, to 
get a sense of the most promising mechanistic 
avenues to follow. “If science has taught me any- 
thing,” he says, “it is to not discount the myriad 
ways of becoming and being.” = 


Virginia Hughes is a freelance journalist in 
New York City. 
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ABETTER BATTERY 


Chemists are reinventing rechargeable cells 
to drive down costs and boost capacity. 


he mobile world depends on lithium- 
ion batteries — today’s ultimate 
rechargeable energy store. Last year, 
consumers bought five billion Li-ion 
cells to supply power-hungry laptops, 
cameras, mobile phones and electric cars. “It 
is the best battery technology anyone has ever 
seen, says George Crabtree, director of the 
US Joint Center for Energy Storage Research 
(JCESR), which is based at the Argonne 
National Laboratory near Chicago, Illinois. 
But Crabtree wants to do much, much better. 

Modern Li-ion batteries hold more than 
twice as much energy by weight as the first 
commercial versions sold by Sony in 1991 — 
and are ten times cheaper. But they are near- 
ing their limit. Most researchers think that 
improvements to Li-ion cells can squeeze in at 
most 30% more energy by weight (see ‘Power- 
ing up’). That means that Li-ion cells will never 
give electric cars the 800-kilometre range of 
a petrol tank, or supply power-hungry smart- 
phones with many days of juice. 

In 2012, the JCESR hub won US$120 mil- 
lion from the US Department of Energy to 
take a leap beyond Li-ion technology. Its stated 
goal was to make cells that, when scaled up to 
the sort of commercial battery packs used in 
electric cars, would be five times more energy 
dense than the standard of the day, and five 
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times cheaper, in just five years. That means 
hitting a target of 400 watt-hours per kilogram 
(Wh kg”) by 2017. 

Crabtree calls the goal “very aggressive”; 
veteran battery researcher Jeff Dahn at Dal- 
housie University in Halifax, Canada, calls it 
“impossible”. The energy density of recharge- 
able batteries has risen only sixfold since the 
early lead—nickel rechargeables of the 1900s. 
But, says Dahn, the JCESR’s target focuses 
attention on technologies that will be crucial in 
helping the world to switch to renewable energy 
sources — storing up solar energy for night- 
time or a rainy day, for example. And the US 
hub is far from alone. Many research teams and 
companies in Asia, the Americas and Europe 
are looking beyond Li-ion, and are pursuing 
strategies that may topple it from its throne. 


(i; LOSE THE DEAD WEIGHT 

Chemical engineer Elton Cairns suspected 
he had tamed a promising-but-wild battery 
chemistry early last year, when his coin-sized 
cells were still going strong even after a few 
months of continual draining and recharg- 
ing. By July, his cells at the Lawrence Berkeley 
National Laboratory in Berkeley, California, 
had cycled 1,500 times and had lost only half 
of their capacity’ — a performance roughly on 
a par with the best Li-ion batteries. 


© 2014 Macmillan Publishers Limited. All rights reserved 


His batteries are based on lithium-sulphur 
(Li-S) technology, which uses extremely cheap 
materials and in theory can pack in five times 
more energy by weight than Li-ion (in prac- 
tice, researchers suspect, it will probably be 
only twice as much). Li-S batteries were first 
posited 40 years ago, but researchers could not 
get them to survive past about 100 cycles. Now, 
many think that the devices are the technology 
closest to becoming a commercially viable suc- 
cessor to Li-ion. 

One of Li-S’s main advantages, says Cairns, 
is that it gets rid of the “dead weight” in a 
Li-ion battery. Inside a typical Li-ion cell, space 
is taken up bya layered graphite electrode that 
does little more than host lithium ions. These 
ions flow through a charge-carrying liquid 
electrolyte into a layered metal oxide elec- 
trode. As with all batteries, current is generated 
because electrons must flow around an out- 
side circuit to balance the charges (see ‘Radical 
redesigns’). To recharge the battery, a voltage is 
applied to reverse the electron flow, which also 
drives the lithium ions back. 

Ina Li-S battery, the graphite is replaced by 
a sliver of pure lithium metal that does dou- 
ble duty as both the electrode and the supplier 
of lithium ions: it shrinks as the battery runs, 
and reforms when the battery is recharged. 
And the metal oxide is replaced by cheaper, 
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lighter sulphur that can really pack the lithium 
in: each sulphur atom bonds to two lithium 
atoms, whereas it takes more than one metal 
atom to bond to just one lithium. All of that 
creates a distinct weight and cost advantage for 
Li-S technology. 

But the reaction between lithium and 
sulphur causes a problem. As the battery is 
charged and discharged, soluble Li-S com- 
pounds can seep into the electrolyte, degrad- 
ing the electrodes so that the battery loses 
charge and the cell gums up. To prevent this, 
Cairns uses tricks made possible by advances 
in nanotechnology and electrolyte chemistry 
— including adulterating his sulphur electrode 
with graphene oxide binders, and using spe- 
cially designed electrolytes that do not dissolve 
lithium and sulphur so much. Cairns predicts 
that a commercial-sized cell could achieve an 
energy-density of around 500 Wh kg’. Other 
labs are reporting similar results, he says. 

Some researchers doubt that the academic 
cheer will translate into commercial suc- 
cess. Laboratories often use low proportions 
of sulphur and lots of electrolyte, which is 
relatively easy to work with but does not cre- 
ate an energy-dense battery. Bumping up the 
sulphur and decreasing the electrolyte makes 
the cell more likely to gum up, says Steve Visco, 
who has spent more than 20 years working on 
Li-S at battery firm PolyPlus in Berkeley, just 
5 kilometres west of Cairns’ lab. Making a 
cheap commercial cell that works over a range 
of temperatures will also be hard, he says. 

At least one company stands by Li-S’s pros- 
pects: Oxis Energy in Abingdon, UK. It says it 
has run large cells for an impressive 900 cycles, 
at energy densities that match current Li-ion 
cells. Oxis is working with Lotus Engineering, 
headquartered in Ann Arbor, Michigan, on 
a project to reach 400 Whkg" by 2016 for an 
electric vehicle. 


(i PACK MORE PUNCH PER ION 

As the world’s lightest metal, lithium provides 
a huge weight advantage. But some researchers 
argue that the next generation of cells should 
switch to heavier elements such as magnesium. 
Unlike lithium ions, which can carry only one 
electrical charge each, doubly charged mag- 
nesium ions shuttle two at a time — instantly 
multiplying the electrical energy that can be 
released for the same volume. 

Magnesium comes with its own challenge, 
however: whereas lithium zips through elec- 
trolytes and electrodes, magnesium with its 
two charges moves as if through treacle. 

Peter Chupas, a battery researcher at 
Argonne National Laboratory who is working 
with the JCESR, is shooting high-energy X-rays 
at magnesium in various electrolytes to investi- 
gate why it experiences so much drag. So far, he 
and his colleagues have found that magnesium 
exerts a strong pull on oxygen atoms in any sur- 
rounding solvent, attracting clusters of solvent 
molecules that make it bulkier. That kind of 
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basic research is key to creating a better bat- 
tery, but it is not usually done by industry, says 
Crabtree. “The typical R&D operation operates 
on trial and error, not fundamental research,” 
he says. This, he says, is where JCESR is bring- 
ing an advantage to the field. 

Materials scientist Kristin Persson at Law- 
rence Berkeley is using a supercomputer to 
simulate the innards of possible new batter- 
ies, trying to find a combination of electrodes 
and electrolytes that will allow magnesium to 
pass through more easily. “Right now, we are 
crunching through around 2,000 different elec- 
trolytes,” she says. 

Persson and Gerbrand Ceder, a materials 


Five times more energy 
dense, and five times 
cheaper, in just five years: 
an “impossible” goal? 


scientist at the Massachusetts Institute of Tech- 
nology in Cambridge, founded a company to 
develop these higher-charge-carrying batter- 
ies. Pellion Technologies, based in Cambridge, 
is tight-lipped about its results; it has published 
only one paper about electrolytes”. A spate of 
patents published in late 2013 hint that the 
company is developing more-open electrode 
structures to help the magnesium ions to flow. 
Major electronics firms such as Toyota, LG, 
Samsung and Hitachi are also working on 
such cells, releasing little information beyond 
occasional teasers. 
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As companies jostle in secret, Persson con- 
tinues to run through what she calls the “elec- 
trolyte genome’. The sifting-by-supercomputer 
approach could also help the search for batteries 
made with other multiple-charge-carrying (or 
‘multivalent’) metals, such as aluminium and 
calcium. Ceder urges patience, pointing out 
that research into Li-ion battery chemistry has 
enjoyed a 40-year head start. “We have so little 
information about multivalent ions,’ he says. 


(i MAKE BATTERIES THAT BREATHE 

Winfried Wilcke, who describes himself as “an 
extremely happy owner of a Tesla S” electric 
car, credits the vehicle with changing his mind 
about battery-research priorities. 

Five years ago, Wilcke, who heads IBM’s 
nanoscience and technology division in San 
Jose, California, launched a project to develop 
a car battery with an 800-kilometre range. At 
the start, he focused on the theoretical ultimate 
in energy-dense electrochemical storage: the 
oxidation of lithium with oxygen drawn from 
the air. Such ‘breathing’ batteries have a huge 
weight advantage over other types, because they 
do not have to carry around one of their main 
ingredients. A lithium—oxygen (Li-O) battery 
can, in theory, store energy as densely as a pet- 
rol engine — more than ten times better than 
today’s car battery packs. 

But after driving more than 22,000 kilo- 
metres in his electric roadster, Wilcke is happy 
with the 400-kilometre range that its battery 
already provides. The real problem, he says, 
is money: battery packs for electric cars cost 
more than $500 kWh *. “What's holding back 
the mass acceptance of electric cars is really the 
price rather than the energy density,’ he says. So 
Wilcke now favours a cheaper breathing battery 
based on sodium. Theory predicts that sodium- 
oxygen (Na-O) batteries could provide only half 
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the energy density of Li-O, but that is still five 
times better than Li-ion batteries. And sodium 
is cheaper than lithium, so Na—O might, Wilcke 
hopes, get closer to the $100-kWh ' goal that 
the JCESR and others have set for affordability. 

Wilcke’s change of heart was undoubtedly 
influenced by the fact that many have given up 
hope on Li-O. Researchers 
who have tried to make it work 
over the past 20 years have 
wrestled with unwanted side 
reactions: carbon in the elec- 


cannot beat Li-ion on measures such as energy 
stored per unit weight. But when it comes to 
storing energy for the electricity grid — or 
other non-portable applications — size does 
not matter. Instead of a small, light battery that 
packs a powerful punch, what people need is 
a battery that cheaply bottles and releases 


RADICAL REDESIGNS 


Lithium-ion batteries are today’s best choice for portable, rechargeable applications. 
Better batteries could be made by changing the electrodes, the electrolyte or the 
charge-carrying ions. Researchers are also pursuing other designs. 


test batteries in Hawaii and at a military base 
on Cape Cod, Massachusetts, this year, each 
supplying tens of kilowatt-hours. 

Other research groups are pursuing less-rad- 
ical flow batteries, in which the fuel consists of 
two liquids that pass ions to each other through 
a membrane. The liquids can be kept in tanks 
outside the battery and pumped 
in to flow past each other when 
needed, so it is possible to store 
larger amounts of energy indefi- 
nitely simply by using bigger 


trolyte and electrode material 
react with the lithium and oxy- 
gen to form lithium carbonate, 
so that in every cycle, some 
5-10% of the battery capacity 
is lost. After 50 cycles or so, the 
battery suffocates. “The bot- 
tom line is that Li-O has zero 
chance for vehicles,” says Stan- 
ley Whittingham at Bingham- 
ton University in New York, 
who invented the concept of 
Li-ion batteries in the 1970s 
and still focuses on squeez- 
ing the best performance out 
of them. Researchers hoping 
to resuscitate Li-O include 
Peter Bruce, a chemist at the 
University of St Andrews, UK. 
“We are closer to what's needed 
than we were a few years ago,” 
he argues. But many consider 
it a lost cause. 

Wilcke took an interest in 


LITHIUM-ION BATTERY 
A chemical energy gradient drives lithium 
ions through a membrane from a graphite 
electrode to a metal oxide one, causing 
electrons to flow around a closed circuit. 


;- Electrode 


MAGNESIUM-ION BATTERY 

Mg2* Redesigning the electrodes and replacing 
the lithium with heavier ions that carry 
more charge, such as magnesium, could 
double the energy carried per volume. 


tanks. But they do need pumps 
and valves, which Sadoway says 
will require maintenance. 

Commercial flow batteries 
use vanadium ions in the liquid 
on both sides of the barrier. But 
vanadium and the membranes 
are expensive: the world’s largest 
flow battery, installed at a wind 
farm in China, probably costs 
$1,000kWh'*, estimates Huamin 
Zhang at the Chinese Academy 
of Sciences’ Dalian Institute of 
Chemical Physics. “The cost of 
vanadium just kills you,” says 
Michael Aziz, a materials sci- 
entist at Harvard University in 
Cambridge, Massachusetts. 

In January this year, a team 
including Aziz announced’ that 
cheap organic chemicals called 
quinones could be used in a flow 
battery, partnered to a standard 
liquid electrode such as bromine. 


the sodium breathing battery 

last year, following a surprising 

discovery by a team including Jiirgen Janek and 
Philipp Adelhelm at the Justus-Liebig Univer- 
sity of Giessen in Germany. They found that a 
Na-O battery recharges more efficiently than 
Li-O, without complicating side-reactions’. 
“We tried it and were pretty stunned,” says Wil- 
cke. Plus, he says, it works with cheap electrodes 
and electrolytes. Janek says that his team has 
now shown that its battery can work reversibly 
for at least 100 cycles — not bad for the early 
days of the technology. Chemicals giant BASF 
is now working with them. 

Dahn, for one, is not convinced. Debate 
rages about whether breathing batteries will 
require heavy filtering equipment to extract 
oxygen from the air, which would cut down or 
even eliminate their energy-per-weight advan- 
tage. “Na-O is just the latest craze,” says Dahn. 
But Wilcke is willing to bet otherwise. 


fia GO BIG FOR THE GRID 
Donald Sadoway’s vision of the future battery 
looks like a smelting plant: he envisions crates 
the size of shipping containers, each holding 
20 refrigerator-sized steel blocks containing 
litres of molten metals and salts heated to 
500°C. 

Such batteries could never fit in a car, and 
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small-to-large amounts of electricity without 
much maintenance. The JCESR wants such 
batteries to last for 7,000 cycles, or about 20 
years. 

“The field is wide open,’ says Ceder. Grid 
suppliers have used banks of cheap, old- 
fashioned lead-acid batteries, for example, 
or stacks of Li-ion. A dizzying array of other 
chemistries are in development, including 
zinc-air and sodium-ion. Most technologies 
are doing well to cost five times as much as the 
JCESR’s $100-kWh™ target. 

Sadoway, a materials chemist at the Massa- 
chusetts Institute of Technology, is developing 
an alternative with two layers of molten metal 
as electrodes, separated by their different den- 
sities and by a layer of molten-salt electrolyte. 
The metal layers swell or shrink as ions pass 
between them, storing or releasing energy. 
Because everything is liquid, there is nothing 
that could crack after thousands of cycles, as 
solid electrodes might. 

Crabtree, Dahn and other researchers worry 
about the energy needed to keep the compo- 
nents molten. But Sadoway says that the charg- 
ing and discharging processes produce enough 
heat on their own. His company — Ambri in 
Marlborough, Massachusetts — plans to install 
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Aziz has cycled his system more 

than 100 times and it is still run- 
ning strong. He hopes that he can get such bat- 
teries below the magic $100 kWh “, but “this 
is a toy in a fume hood in a laboratory right 
now’, he says. “There is no way to know the 
true cost until you are mass-producing it.” 

Crabtree calls the work “promising” and 
says that the JCESR is also looking at organic 
chemicals for flow batteries. Another option 
it is pursuing is to use liquid Li-S and solid 
lithium in a sort of half-flow battery. 

“Tt’s early days: people are looking at really 
oddball systems, and everyone's trying to 
figure out how to get the lifetime up and 
the costs down,” says Dahn. The JCESR, for 
one, is hoping that basic research can fill in 
the gaps and make these technologies work. 
“The beyond-lithium-ion space is rich with 
opportunity,” says Crabtree, “and mostly 
unexplored.” = 


Richard Van Noorden is a senior reporter for 
Nature in London. 
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The proposed Cherenkov Telescope Array (artist’s impression) might detect light flashes from y-rays produced when dark-matter particles interact. 


Broaden the search for 
dark matter 


Bold strategies are needed to identify the elusive particles that should 
make up most of the Universe’s mass, say Mario Livio and Joe Silk. 


ark matter is living up to its name. 
D In spite of decades of compelling 

evidence from astronomical obser- 
vations showing the existence of matter that 
neither emits nor absorbs electromagnetic 
radiation, all attempts to detect dark matter’s 
constituents have failed. 

The presence of dark matter is inferred 
from its gravitational effects. Stars and gas 
clouds in galaxies and galaxies in clusters 
move faster than can be explained by the 
pull of visible matter alone. Light from dis- 
tant objects may be distorted by the gravity 


of intervening dark material. The pattern of 
large-scale structures across the Universe is 
largely dictated by dark matter. In fact, about 
85% of the Universe's mass is dark, account- 
ing for about one-quarter of the total cosmic 
energy budget. 

Despite its ubiquity, the nature of dark mat- 
ter eludes us. Negative results have flowed 
from searches for candidate particles to 
explain it. In 2013, the Large Underground 
Xenon (LUX) experiment — the most sensi- 
tive detector of its kind — in the Homestake 
Mine in Lead, South Dakota, reported no 
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signs of dark matter in its first three months 
of operation’. The Large Hadron Collider 
(LHC) at CERN, Europe's particle-physics 
laboratory near Geneva, Switzerland, has 
found no evidence for the existence of what 
some think are the most likely culprits: super- 
symmetric particles, theoretically predicted 
partners to the known elementary particles. 

Is there light at the end of this dark tunnel? 
Possibly — but only if searches become 
bolder and broader. More varied particle 
types should be sought. Definitive tests need 
to be devised to rule out some classes of 
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> dark matter and some theories. If dark 
matter remains undiscovered in the next 
decade, then physicists will have to seriously 
reconsider alternative theories of gravity. 


EXOTIC PARTICLES 

We know a little about dark matter’. Because 
it does not absorb light or interact with elec- 
tromagnetic waves, the majority of it cannot 
be made of baryons — particles of ordinary 
matter, including protons and neutrons, 
which are composed of three quarks. And 
dark matter must lie beyond the standard 
model of particle physics to avoid upset- 
ting Big Bang nucleosynthesis, the theory 
of which successfully predicts the observed 
abundances of light elements such as deu- 
terium, helium and lithium arising from 
interactions in the early Universe. 

The main constituents are expected to 
be weakly interacting massive particles 
(WIMPs)’. These particles have masses a few 
tens to thousands of times that of the proton. 
WIMPs interact among themselves and with 
ordinary matter gravitationally and through 
the weak force, but not electromagnetically or 
through the strong nuclear force. To explain 
the way in which galaxies form and cluster, 
dark-matter particles should be relatively 
slow moving, or ‘cold’ If they were faster, and 
could move easily beyond the dimensions of 
a protogalaxy, many structures visible today 
would have been washed out. 

What kind of particles might WIMPs be? 
Many physicists guess the most likely can- 
didates are the lightest of the supersymmet- 
ric (SUSY) particles. Theories of the early 
Universe contain mathematical symmetries 
that allow for each known particle, such as 
the electron, photon or quark, to have a (yet 
undiscovered) massive partner. Just after 
the Big Bang, almost all of the heavy SUSY 
particles decayed or annihilated one another 
— but the lightest SUSY particles, unable to 
decay any further, were stable and survived. 

The number of SUSY particles remain- 
ing depends on their masses and interaction 
strengths, which can be predicted from 
theory. The properties must be just right for 
the relic particle density to match that needed 
to explain the observed effects of dark matter. 
Light SUSY particles fit that bill, motivating a 
plethora of experiments to try to detect them. 
Yet no WIMPs have been seen. 

Finding new particles is challenging. It took 
four decades to spot the Higgs boson. But 
through current and planned experiments we 
should be able to rule out several candidates 
for dark matter in the next decade. The goal is 
to detect the particles that constitute the mas- 
sive halo of dark matter that surrounds the 
Milky Way as they pass through our detectors 
at rates ofa few per second per square metre’. 
Because they interact so weakly, it takes a 
huge effort to capture them. If we are lucky, a 
300-day run of LUX, scheduled to begin later 
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this year, could detect WIMPs. Or it may 
take bigger detectors — and more than one 
method (see ‘What’s the matter?’). 

Some estimate that detectors with 
100 times more mass than LUX will be 
needed to see WIMPs at the rates expected. 
LUX ZEPLIN, a planned upgrade that could 
begin operation around 2019, would use 
7 tonnes of liquid xenon compared with 
LUX’s current 370 kilograms. We may yet 
require 100-tonne detectors, but that is the 
practical limit. Further sensitivity is thwarted 
by an irreducible neutrino background, 
mainly from supernovae, the Sun and cosmic 
rays hitting Earth's atmosphere. 

Dark-matter particles might be created 
in colliders. The LHC is expected to resume 

operation in 2015 at 


“Some an energy of 14 tera- 
theorists have  electronvolts (TeV), 
even started twice the energy that 
to wonder led to the discovery 
whether of the Higgs boson. 
dark matter SUSY particles and 
exists.” signatures of depar- 


ture from the standard 
model might be glimpsed. But the lack of any 
SUSY signals so far suggests that we may need 
much higher energies to see them. A 100-TeV 
collider, which many particle physicists sup- 
port as the next step after the LHC, is slated 
to begin construction around 2020 and would 
be exciting for dark-matter searches’. 


DIFFICULT TO DETECT 

Other dark-matter experiments have made 
intriguing detections, the interpretations of 
which are still debated. The DAMA/LIBRA 
experiment, and its predecessor, at Gran 
Sasso National Laboratory in Italy, have 
been looking for changes in the flux of dark- 
matter particles hitting Earth for 14 years. 
As Earth orbits the Sun and as both travel 
through the Milky Way, the velocities of the 
two bodies combine. For half the year, their 


WHAT'S THE MATTER? 


velocities are in the same direction; for the 
other half, they are in opposition. This pro- 
duces an annual modulation in the rate at 
which the dark-matter particles, travelling 
in random orbits, fall on the detector. 

In 2013, the DAMA/LIBRA team 
reported® such a variation with an accu- 
mulated high statistical significance (more 
than nine sigma). Similar cycles, consist- 
ent in phase but with an amplitude larger 
than expected, were seen (with low statisti- 
cal significance) independently earlier this 
year in just over three years of data from the 
CoGeNT (Coherent Germanium Neutrino 
Technology) Dark Matter Experiment’ in 
Soudan, Minnesota. Yet most physicists 
query whether the DAMA/LIBRA results 
really are because of WIMPs and not some 
other annual phenomenon, such as neu- 
trons leached from the rocks surrounding 
the underground experiment in response to 
seasonal temperature variations. 

Indirect attempts to detect dark matter® 
have been equally inconclusive. The Alpha 
Magnetic Spectrometer (AMS-02) on the 
International Space Station reported’ last 
year an excess of positrons in the cosmic-ray 
spectrum up to 350 gigaelectronvolts (GeV), 
consistent with being produced by dark- 
matter particles colliding and annihilating. 
These results strengthened similar reports 
from the Payload for Antimatter Matter 
Exploration and Light-nuclei Astrophysics 
(PAMELA) satellite. But the positrons could 
have other origins, such as winds from pul- 
sars (rapidly rotating neutron stars). Obser- 
vations with AMS-02 at higher energy in the 
next two years might distinguish between 
these hypotheses. 

Another source of excitement was the 
detection last year with the Fermi Gamma-ray 
Space Telescope of an excess of y-rays near 
the Galactic centre, where dark matter 
should concentrate. A narrow spectral line 
at 130 GeV apparently associated with the 


Dark-matter particles (D), such as weakly interacting massive particles and axions, can be spotted through 
their interactions with various types of standard-model particles (S) or with themselves. Experiments may 
detect them in four ways: directly; indirectly, by the particles such as photons they give off when they 
interact; in colliders; or through astrophysical observations. 
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LUX, Large Underground Xenon experiment; CoGeNT, Coherent Germanium Neutrino Technology Dark Matter Experiment; CTA, Cherenkov 
Telescope Array; HESS, High Energy Stereoscopic System; AMS, Alpha Magnetic Spectrometer; PAMELA, Payload for Antimatter Matter 
Exploration and Light-nuclei Astrophysics; LHC, Large Hadron Collider. *Planned experiment or observations. 
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excess could indicate dark-matter-particle 
annihilations or decays. But a similar line 
from Earth’s atmospheric limb implies that at 
least part of the signal must be instrumental 
in origin. A conclusive test may come in the 
next couple of years from the HESS (High 
Energy Stereoscopic System) y-ray telescope 
in Namibia, which is observing the inner Gal- 
axy in the 100 GeV to 1 TeV energy range. 

The null results from LUX, the LHC and 
many other experiments are narrowing the 
range of possible particles that could explain 
dark matter. As claimed detections pop up 
only to disappear, physicists are becoming 
justifiably sceptical about every announce- 
ment of a discovery. 

Some theorists have even started to won- 
der whether dark matter exists. Since the 
1980s, a few have proposed modifying the 
theory of general relativity to do away with 
the need for dark matter. Such radical ideas 
are increasingly invoked to address another 
grave problem in astrophysics: the origin of 
the ‘dark energy’ that accelerates the expan- 
sion of the Universe. Most researchers think 
that we are far from needing new physical 
laws, especially because experimental aven- 
ues are still open. But unpleasant surprises 
are always possible. 

There are two worst-case scenarios. First, 
dark matter may not comprise one type of 
particle — as many current searches assume 
— but many. Second, the particles might 
interact only gravitationally, and could be 
practically invisible to conventional detectors. 


NEW DIRECTIONS 
Existing experiments should run their course. 
But new approaches are needed to tease out 
dark-matter particles in the next decade. 

A dark-matter modulation experiment, 


The LUX experiment detects photons produced when dark-matter particles interact with liquid xenon. 


such as DAMA/LIBRA or CoGeNT, in the 
Southern Hemisphere would gauge the 
extent of Earth’s seasonal effects, which 
would be out of phase relative to the north. 

Clumps or streams of dark matter mov- 
ing through the Milky Way, distorting the 
rate at which particles hit detectors, should 
be visible as disturbances in the motions of 
the roughly one billion nearby stars that will 
be tracked by the European Space Agency’s 
newly launched GAIA satellite during its 
five-year mission. 

At the LHC and other next-generation 
accelerators, particle collisions with missing 


off electrons in the solar plasma might be 
detectable through helioseismology. Gravi- 
tational lensing could measure the more- 
spherical dark haloes of distant galaxies, 
which are expected if dark-matter particles 
interact electromagnetically, albeit feebly. 

Third is the axion. Predicted to explain an 
anomaly in quantum chromodynamics, the 
theory of the strong force, the electromag- 
netic signatures of axions have been long 
sought in the lab without success. String 
theory suggests types of ultralight axion 
that would be slightly more ‘warm’ than cold 
dark matter. Mixes of cold and warm dark 
matter, perhaps also including neutrinos”, 
might explain, for example, why there are 
fewer dwarf galaxies than cold-dark-matter 
scenarios predict. 

Astrophysicists should look for unusual 
signals in old stars, such as neutron stars 
and white dwarfs. As stars orbit their galaxy, 
they accumulate WIMPs. Collected in the 
core ofa neutron star, WIMPs might form a 
tiny black hole that could eventually devour 
the star, causing a violent explosion — an 
event that has yet to be seen. Helioseismol- 
ogy could also probe the effect of WIMPs on 
the Sun’s temperature profile. 

To refine theoretical and experimental 
strategies, particle physicists and astro- 
physicists need to communicate better. The 
number of dark-matter-candidate particles 
to be explored is limited, bounded at low 
masses by our failure to see anything and 
at high masses by the constraints of theory. 
A multidisciplinary approach to explore 
the 1-100 TeV mass-energy range should 
be the next frontier for the dark-matter 
community. m 


energy — drawn by an unknown particle 
or other unexpected signatures could illumi- 
nate the dark sector. 

We must also broaden directed searches 
and exploit astrophysical methods. First, we 
should look towards more massive particles, 
such as the SUSY particles. It will be diffi- 
cult to detect heavy particles directly because 
there will be fewer of them. But y-ray astron- 
omy may help. The Cherenkov Telescope 
Array — an international project to build 
more than 100 ground-based telescopes to 
capture light flashes from y-rays scattered 
by the atmosphere — should after 2015 
open the window to 100-TeV energies. 
This energy coincides nicely with the high- 
est limit on the WIMP mass expected from 
fundamental physics arguments. Such par- 
ticles would generate TeV y-rays when they 
annihilate or decay. 

Second, broader categories of dark-matter 
particles should be sought. Like ordinary 
matter, dark matter could be complex, 
perhaps carrying a small charge, or having 
internal states akin to the electron levels of 
an atom. Changes in the Sun’s oscillations 
as clouds of ‘millicharged’ particles scatter 
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Domestic goats and sheep can graze marginal lands, such as a in the Gobi Desert in Mongolia. 


Steps to sustainable Movmeaie 


With improved breeding and cultivation, ruminant animals can yield food that is 
better for people and the planet, say Mark C. Eisler, Michael R. F. Lee and colleagues. 


he need for efficient food production 
| has never been greater. One in seven 
humans is undernourished’. Urbani- 
zation and biofuel production are reducing 
land availability, and climate change, lack of 
water and soil degradation are decreasing 
harvests. Over the past decade, cereal yields 
per hectare have fallen in one-quarter of 
countries. Meanwhile, developing nations 
and the growing world population are 
demanding more animal protein. 

The increasing consumption of animal 
protein is generally considered at odds with 
Earth’s ability to feed its people. The 1 billion 
tonnes of wheat, barley, oats, rye, maize 
(corn), sorghum and millet poured annually 
into livestock troughs could feed some 3.5 bil- 
lion humans. But such reasoning discounts 
the health benefits of eating modest amounts 
of meat and the fact that foraging animals can 
consume foods that humans cannot eat. 

Crop and livestock farming complement 
each other’. Half the world’s food comes from 
farms that raise both. Animals pull ploughs 
and carts, and their manure fertilizes crops, 
which supply post-harvest residues to live- 
stock. But efforts to maximize yields of milk 
and meat can disrupt finely balanced systems. 
The quest for ‘intensification’ in livestock 
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farming has thundered ahead with little 
regard for sustainability and overall efficiency 
(the net amount of food produced in terms of 
inputs such as land and water). With animal 
protein set to remain part of the food supply, 
we must pursue sustainable intensification 
and figure out how to keep livestock in ways 
that work best for individuals, communities 
and the planet. 

Almostall of the world’s milk and much of 
its meat come from ruminant (cud-chewing) 
animals — mostly cows, goats and sheep, but 
also buffalo, camels, llamas, reindeer and 
yaks. Here we highlight eight strategies to 
cut the environmental and economic costs 
of keeping these animals while boosting net 
gains for the quantity and quality of the food 
they produce. 


Feed animals less human food. Around 
70% of the grains used by developed coun- 
tries are fed to animals. Livestock consume 
an estimated one-third or more of the world’s 
cereal grain, with 40% of such feed going to 
ruminants, mainly cattle’. 

Some of this is avoidable. Ruminants graze 
pastures and can eat hay, silage and high- 
fibre crop residues that are unsuitable for 
human consumption. Unlike pigs, poultry 
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and humans, ruminants have a series of 
forestomachs leading to the true stomach. In 
the forestomachs, the largest of which is the 
rumen, microbes break down fibrous plant 
material into usable calories and also provide 
high-quality microbial protein. Ruminants 
can graze in marginal areas, such as moun- 
tainsides or low-lying wet grasslands. This 
helps to reserve agricultural fields for grow- 
ing human food. 

Even where large quantities of cereals are 
consumed by ruminants, up to 60% of their 
diet comes from high-fibre feed that humans 
cannot digest. In the European Union, more 
than 95% of milk comes from animals fed 
on grass, hay and silage, supplemented with 
cereals. Cattle in New Zealand’s exemplary 
dairy industry obtain 90% of their overall 
nutrition by grazing pasture’. China's grow- 
ing dairy industry initially relied on imported 
grain and high-quality fibre from the Ameri- 
cas. Ongoing research is showing how best to 
use local crop residues, such as rice straw. 


Raise regionally appropriate animals. The 
lure ofhigh productivity has led to ill-advised 
schemes to import livestock to places where 
they are genetically unsuited. Kerala, a state in 
southern India, is home to the smallest breed 
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of cattle in the world. Vechur cows stand at 
about 90 centimetres tall and make only 
around 3 litres of milk per day — a dribble 
compared to the 30 litres per day produced 
on average by Holsteins, the black-and-white 
dairy cows of Europe and North America. 
Donors, governments and charities aiming 
to feed whole communities, and to provide 
income for poor farmers, have imported 
Holstein breeding stock and semen to Africa 
and Asia, with progeny now numbering in 
the millions. But the animals often disap- 
point. Bred for centuries for maximum milk 
production in temperate climates, these cows 
were not selected for fertility or hardiness. 
They lack resistance to heat, humidity, tropi- 
cal diseases and parasites, and so must be kept 
in stalls away from ticks and other disease 
vectors. Rather than allowing the animals out 
to pasture, farmers in tropical areas must cut 
and carry fodder to the animals or purchase 
expensive, often imported, feed. Even then, 
the cows produce less than one-third of yields 
seen in temperate climates and controlled 
environments. For poor families, a smaller 
native cow is a better bet than a larger animal 
that costs more to keep alive and healthy. 
Similarly, breeds of cattle usually farmed 
in the humid tropics of West Africa have 
developed resistance to the debilitating dis- 
ease trypanosomiasis over several thousand 
years of exposure to the tsetse fly that carries 
it. In the hope of greater profits and wealth, 
farmers often replace these animals with 
larger European cattle, or with zebu breeds 
from areas north of the tsetse belts. The zebu 
breeds are less resistant to trypanosomiasis, 
and European cattle have no resistance. 
The expense of drugs to combat the disease 


CUD CHEWERS 


often outweighs the increase in income. 
More can and must be done to encourage 
farmers to realize the advantages of livestock 
adapted to local areas. Cutting-edge genomics 
could guide selective breeding to boost pro- 
duction of animals that are already adapted to 
their climates and resistant to local diseases. 


Keep animals healthy. Sick animals can 
make people sick. In low- and middle- 
income nations, 13 livestock-related zoon- 
oses (diseases that can infect humans and 
animals) cause 2.4 billion cases of human 
illness and 2.2 million deaths each year*. 
Yet human and livestock disease are gener- 
ally treated as separate problems. Animal 
management should include measures to 
contain transmissible diseases, for example, 
by improving hygiene, quarantining new 
arrivals on farms and establishing coordi- 
nated, sustained surveillance for diseases that 
cross the boundaries of species or countries. 

Mismanagement and poor welfare ren- 
der animals particularly susceptible to 
parasites and disease. Many young animals 
die of disease before they can lactate, reach 
slaughter weight or reproduce. This lowers 
yields, increases environmental impacts and 
decreases farmers’ ability to select the best 
breeding stock. With education and some 
financial aid, farmers could improve hus- 
bandry, and more animals would survive to 
become productive. 

Keeping animals at high densities spreads 
infectious diseases far and fast. The foot-and- 
mouth virus costs upwards of US$5 billion 
each year in vaccinations and lost production 
worldwide. A UK epidemic in 2001 resulted 
in the slaughter of 6 million animals. Bovine 


Numbers of domesticated ruminant animals are falling in developed countries and rising in developing ones. 
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tuberculosis has cost UK taxpayers alone 
£500 million (US$830 million) over the past 
decade — an amount projected to double in 
the next ten years. Market disruptions and 
losses are felt across industries including 

agriculture, transportation and tourism. 
European Union law holds farmers respon- 
sible for human health and food-safety issues 
following the slaughter of their animals. 
Growing awareness of problems such as anti- 
biotic resistance has led to approaches that 
rely less on anti-infective drugs and more 
on management practices, such as reducing 
overcrowding. Sim- 


“The focus ple decision-support 
should be on tools are emerging to 
eating less, help farmers to treat 
better quality affected individu- 
meat.” als rather than entire 


herds, and to keep 
animals away from risky pastures or other 
sources of infection’. Gathering local evi- 
dence can confirm the benefits of such strat- 
egies and encourage farmers to adopt them. 


Adopt smart supplements. The productivity 
of ruminant animals can often be boosted 
with supplements, some of which encourage 
microbes in the rumen to grow quickly and 
to provide better nutrition. In India, a water 
fern (Azolla caroliniana) cultivated in local 
ponds provides extra protein to cattle and 
goats fed on protein-deficient elephant grass 
(Pennisetum purpureum). 

Other plant extracts can alter the rumen 
microbial population to use nitrogen and 
energy more efficiently. This means produc- 
ing more meat and milk with proportionally 
less by-product greenhouse gas and ammo- 
nia. An enzyme in red clover (Trifolium 
pratense), widely grown in temperate coun- 
tries, increases ruminants’ ability to utilize 
dietary protein®. In field trials, dairy cows 
with more clover in their diets ate more feed 
and produced more milk. In Australia, sheep 
nibble on the deep-rooted perennial tar bush 
(Eremophila glabra) during dry autumns 
when most other pasture plants offer poor 
food value. Tar bush combats gastrointestinal 
nematodes and acidosis, and reduces emis- 
sions of methane, a greenhouse gas 25 times 
more potent than carbon dioxide’. 

Governments and policy-makers should 
support research efforts to identify the most 
beneficial microbes and most limiting nutri- 
ents, as well as low-cost ways to deliver them. 


Eat quality not quantity. Annual meat con- 
sumption in India is just 3.2 kilograms per 
person, compared with 125kg per person in 
the United States in 2007, much of it from 
heavily processed foods, such as burgers, 
sausages and ready meals. The focus should 
be on eating less, better quality meat. In rich 
countries, the high quantity and low qual- 
ity of processed animal products consumed 
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Unlike Vechur cows (left), Holstein cattle (right) have little resistance to heat, humidity and tropical diseases, and are most productive in controlled environments. 


contributes to ill health, with higher rates 
of cancer and coronary heart disease. For 
the world’s poor people, however, there are 
clear nutritional advantages to consuming 
small amounts of high-quality animal foods, 
which are rich in protein, essential amino 
acids, iron and various essential micronutri- 
ents that improve chances for normal physi- 
cal and cognitive development’. 

The public-health goal, therefore, should 
be to balance nutrition across the world, with 
a target of weekly average consumption of red 
meat of no more than 300 grams. Trends are 
in the right direction; numbers of ruminants 
in the developed world have fallen over the 
last two decades (see ‘Cud chewers’). 


Tailor practices to local culture. Close to 
one billion of the world’s poorest people rely 
on livestock for their livelihood. Traditional 
animal husbandry supplies more than just 
food’. Keeping animals provides wealth, sta- 
tus and even dowry payments. When families 
encounter large expenses, such as a hospital 
bill or a wedding, they can sell an animal or 
two to cover the cost. Many of these benefits 
are disrupted when conventional grazing and 
mixed-farming practices are replaced with 
industrial systems that maximize short-term 
production. Policies to encourage humane, 
efficient management should consider cul- 
tural as well as natural factors. For instance, 
in traditional herding communities in the 
Horn of Africa, philanthropic efforts to sup- 
port the cattle trade have led to larger herds 
for wealthier individuals, with little evidence 
that they have benefited poorer pastoralists. 


Track costs and benefits. Livestock are 
widely considered to be unsustainable. 
The livestock sector accounts for 14.5% of 
human-induced greenhouse-gas emissions, 
exceeding that from transportation. 
However, if other factors are consid- 
ered, the picture becomes more favourable. 
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Sustainably managed grazing can increase 
biodiversity, maintain ecosystem services and 
improve carbon capture by plants and soil’®. A 
cow produces up to 70kg of manure per day, 
providing enough fertilizer in a year for one 
hectare of wheat, equivalent to 128kg of syn- 
thetic nitrogen that might otherwise derive 
from fossil fuels. Mechanized arable agricul- 
ture and food processing themselves produce 
greenhouse gases, and costs of switching are 
exacerbated if meat’s nutritional advantages 
are considered”. Farm animals also provide 
hides, wool, traction and biogas, a fuel pro- 
duced from manure. 

Calculating how this balances out is 
hard, but essential. Life-cycle assessment 
data should be used to tune livestock poli- 
cies to socioeconomic and geographic 
environments. 


Study best practice. To explore the multi- 
disciplinary strategies described here, we are 
building a global network of research farms. 
Three such ‘farm platforms’ are operational. 
Two focus on the use of naturally adapted 
livestock and native plants: the University of 
Western Australia Future Farm in Pingelly, 
which has Mediterranean biome conditions 
and where water conservation is crucial, and 
the Thiruvazhamkunnu Livestock Research 
Station in Kerala, India, which has humid 
tropical conditions and where grazing is 
strictly limited. At the third, Rothamsted 
Research North Wyke Farm Platform in 
Devon, UK, cattle and sheep graze in temper- 
ate grassland conditions on three hydrologi- 
cally isolated, 22-hectare ‘farmlets’ to compare 
nutrient cycling and productivity under vari- 
ous pasture-management strategies. There are 
plans to establish further platforms in South 
America, North America and China. 

There will be no one-size-fits-all solutions. 
Changing farming practices is difficult, but 
farm platforms can evaluate potential for 
increased profits and other benefits, act as 
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examples to follow, and provide information 
for policy-makers. We hope to identify bet- 
ter practices to optimize the use of livestock 
in different regions, using local resources, 
breeds and feedstuffs — and produce tangible 
evidence to convince local farmers. = 
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The cryptic mind 


Li Gong ponders a translation of a Chinese novel on the world of code-breaking. 


Bestselling novelist Mai Jia is alleged to have worked in cryptography. 
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tradictions of crypto- 

graphy. However implausible-sounding 
the story of Mai Jia’s breakthrough 2002 
novel Jie Mi (Decoded), this is a spy thriller 
grounded in subtle and difficult realities. 
Mai — one of China’s pre-eminent writers 
— is alleged to have worked in cryptography 
himself. The battle in the reader’s mind 
over whether this is florid drama or brutal 
realism is perhaps the primary and primal 
attraction of the novel, translated into Eng- 
lish for the first time by Olivia Milburn and 
Christopher Payne. 

Decoded is written in the style of a film 
script, with rapid cutting between scenes 
and the reader thoroughly gripped by the 
characters’ tribulations. It is also neces- 
sarily dressed up with copious references 
to mathematics — names of mathemati- 
cians, theorems, and encoding and decod- 
ing methods — and even includes a few 
formulae. Yet its main theme is human 
psychology, and especially the tortured 
psyches of those who, like protagonist 
Rong Jinzhen, spend their lives trying 
either to hide information or to crack 
the protective puzzles of their enemies. 
Rong’s old tutor, for instance, writes to his 
former protégé, condemning decryption 
and cipher construction as “fundamen- 
tally anti-scientific, anti-intellectual ... 
a poison that mankind has developed to 
destroy science and a conspiracy against 
the people that work with them”. 

The translation is good overall, but there 
are a few fatal errors. For example, Mai takes 
a swipe at Chinese higher education, point- 
ing out that a major problem with Chinese 
academics is that they start out as scholars 
and end their careers as government offi- 
cials; he hints that they regard this path as 
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a most desirable and natural progression. 
As the translation has it, however, Chinese 
academics regard scholarship and official- 
dom as incompatible — the exact opposite 
of Mai’s meaning. 

Another lapse undermines a central 
tenet of the novel: the supposed accepted 
wisdom in cryptography that one person 
can only design, or crack, one good cipher 
at most. This is because cryptographers can 
be made vulnerable both by subconscious 
similarities in their own ciphers, and by the 
exposure of their specific code-breaking 
strengths when they crack other people’s 
ciphers. Yet the English version translates 
this as it being impossible for one person 
to be both a cipher designer and a code- 
breaker. As a result, readers might not fully 
understand the tragedy of the story — that 
aman who successfully broke the enemy’s 
sophisticated Code Purple goes mad at fail- 
ing to break its successor. 

With this and subsequent novels such as 
the 2003 An Suan (In the Dark, forthcoming 
from Penguin) selling in their millions and 
being adapted into high-profile television 
series, Mai has firmly established himself 
as the father of modern Chinese espionage 
thrillers. Critical acclaim for his work has 
included China’s most prestigious book 
award, the Mao Dun Literature Prize. 

So is Mai — the supreme storyteller of 
psychological warfare, intrigue and the 

human sufferings of 


“Decoded alienation — a brave 
is dressed writer who unveils 
up with national security 
references to programmes and 
mathematics, pinpoints untold 
yetits main human sacrifices in 
themeis a secretive state? Or 
human is he a commercial 
psychology.” master who plays on 


the public’s desire to 
peer into a covered-up world in a country 
where media is officially controlled and 
pumps out content-free content every day? 
Is he exploiting the public’s wish to believe 
that an outrageous, even over-dramatized 
story is the best vehicle through which to 
tell an untellable truth? 

Even publishers had difficulty answering 
this question. As Mai relates in an appendix 
to a 2011 Chinese edition, he first submitted 
the manuscript of Decoded to two friendly 
publishing houses. Both editors quickly 
rejected his work. One thought the story 
too fake to attract interest; the other felt it 
was too real and sensitive, and could cause 
trouble for the publisher. Decoding Mai Jia’s 
real intentions is perhaps the book’s most 
perplexing challenge. m 


Li Gong is President of Asia Operations at 
Mozilla Corporation in Beijing. 
e-mail: lgong@mozilla.com 
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The Extreme Life of the Sea 

Stephen R. Palumbi and Anthony R. Palumbi PRINCETON UNIVERSITY 
Press (2014) 

Marine biologist Stephen R. Palumbi and writer Anthony R. Palumbi 
survey an impressive catch of extreme oceanic species, from the 
oldest to the deepest-dwelling. They are inspired guides, weaving 
evolutionary and geological backstories into accounts of wonders 
such as the exquisite architecture of sharks’ teeth. And they pull 

no punches in depicting potential futures with devastated oceans 
dominated by “bacteria, jellyfish and tar-like algae”. A brilliant use of 
the rich store of research into Earth’s largest habitat. 


Operation Paperclip: The Secret Intelligence Program that 
Brought Nazi Scientists to America 

Annie Jacobsen LITTLE, BROWN (2014) 

An ethical murk can hang over science in service to the state, as Annie 
Jacobsen reveals in this chilling history. Operation Paperclip united 
Nazi science and US cold-war interests, bringing 1,600 German 
technologists to the United States after the Second World War to 
work on intelligence and weapons research, despite protests from the 
likes of Einstein. Jacobsen focuses on 21 scientists, including rocket 
pioneer and Third Reich insider Wernher von Braun, and reveals 
disturbing evidence of Paperclip’s legacy in US science and medicine. 


Cancer Virus: The story of Epstein-Barr Virus 

Dorothy H. Crawford, Alan Rickinson and Ingolfur Johannessen 

OXFORD UNIVERSITY PRESS (2014) 

Fifty years ago, the discovery of a virus that triggers cancer in humans 
rocked the medical world. In this pithy, pacy study, the Epstein-Barr 
virus is biographized by three scientists who work on it — Dorothy H. 
Crawford, Alan Rickinson and Ingélfur Johannessen. Starting with 
the discovery of Burkitt’s lymphoma in the 1950s and the isolation 
of the virus by Michael Anthony Epstein and Yvonne Barr in 1964, 
they follow its trail to effects in China and the labs where it has proven 
research gold for molecular biologists and geneticists. 


The Wives of Los Alamos: A Novel 

TaraShea Nesbit BLOOMSBURY (2014) 

Behind the men behind the Los Alamos nuclear-research programme 
were women whose story has been waiting to be told. TaraShea Nesbit 
has done it lyrically in this novel. Written in the collective voice of “the 
wives” — international, often highly educated women — this chronicle 
of the Manhattan Project’s secret wartime base in New Mexico unfurls 
as they lived it, distorted by necessary lies. Their strange existence as 
housewives, “calculators” or lab technicians forms a vivid foreground 
to the building of the bombs that finished the Second World War, with 
Nesbit deftly capturing the claustrophobic surreality of it all. 


Ha!: The Science of When We Laugh and Why 

Scott Weems BASIC Books (2014) 

Mirth, points out cognitive neuroscientist Scott Weems, is still 
something of a conundrum — but one well worth cracking. His 
journey through the jovial looks in turn at what it is, what it is for 
and why we should cultivate it. We encounter British psychologist 
Richard Wiseman’s LaughLab and its findings (Americans laugh 
at insults; Europeans savour the absurd), the power of conflict 
and messy thinking, the speculation that atheists are funnier, the 
beneficial impact of laughter on pain tolerance, and more. 
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Migrant nation 


Ewen Callaway revels in an exhibition chronicling the 
ebb and flow of early humanity over the British Isles. 


espite what some politicians say in 
D these inward-looking times, Britain 

is and always has been a nation of 
immigrants. Even the palest Brits descend 
from Middle Eastern stock or other waves 
of foreigners, beginning with the arrival of 
agriculture. Britain: One Million Years of 
the Human Story — an enlightening and 
engaging exhibition at the Natural History 
Museum (NHM) in London — lays out the 
evidence for an immensely long history 
of migration, ebbing and flowing over the 
millennia. 

The exhibition is the fruit of a 13-year 
research project led by the NHM, the Ancient 
Human Occupation of Britain. It shows that 
Britain hosted ten distinct occupations 
of various hominin species (described as 
humans throughout the exhibition) between 
around 1 million and 12,000 years ago. “Nine 
of them died out, and we are the tenth,’ says 
Chris Stringer, research leader in human ori- 
gins at the NHM. 

For many, the show’s biggest appeal will 
be the geography, not the genealogy. Those 
900,000-year-old hand-axes once used to 
butcher big game? They were found near 
a wide sandy beach in Norfolk, popular 
with dog-walkers. That 200,000-year-old 
hippopotamus skeleton? Pigeons now 
pace Trafalgar Square in the heart of Lon- 
don, where it was dug up. For an exhibition 
about the human past, there are surprisingly 
few human bones on display. This is not a 
curatorial oversight, but a consequence of 
the dearth of human fossils in Britain. The 
show may be better for it. Because it is forced 
to rely on evidence such as stone tools and 
marked animal bones, it demonstrates 
how researchers make inferences about 
the people who left such artefacts 
behind. 

Take the first humans to inhabit 
Britain, more than 900,000 years 
ago. An intriguing find revealed just 
last month shows that they may have 
belonged to the species Homo ante- 
cessor, first discovered at Atapuerca 
in northern Spain. A team including 
Stringer found footprints embedded 
in stone near Happis- 
burgh on the Norfolk 
coast, and speculated 
that the appearance 
and the gait of the 
humans who made 


Neanderthals 
(reconstruction) 
migrated to the 
British Isles at least 
twice. 


Britain: One 
Million Years of the 
Human Story 
Natural History 
Museum, London. 


them would have been 
similar to those of 
H. antecessor. A short 
documentary film 


and photo display of} Unti 28 September 
the footprints — now 2014. 

eroded away by the sea 

— are shown. 


The next humans to call Britain home 
may have been Homo heidelbergensis, known 
from fossils in Germany and elsewhere. This 
exhibit points to large flint hand-axes found 
alongside the butchered remains of a rhi- 
noceros near Boxgrove, West Sussex. A film 
re-enacting the grisly slaughter threatens to 
overshadow a fascinating exhibit nearby: 
electron micrographs of the scored rhino 
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bones, which helped to show that the dam- 
age was caused by human butchery rather 
than environmental weathering. 

Neanderthals, the big-bodied hunters 
who lived in Europe and western Asia, 
came as least twice to the British Isles, which 
were intermittently connected to continen- 
tal Europe. The Neanderthals’ first stretch 
in Britain, 400,000-200,000 years ago, was 
interrupted by a harsh cold snap that made 
the area uninhabitable for humans. The rise 
of waters with rapid warming then turned 
Britain into an island, preventing conti- 
nental Neanderthals from returning until 
alternative routes opened up. Their second 
visit, starting less than 100,000 years ago, 
may have been interrupted by anatomically 
modern humans who arrived from Africa 
some 60,000 years later. 

Many details of the relationship between 
Neanderthals and modern humans stand 
on scientifically thin ice, but the exhibi- 
tion mostly avoids just-so stories (Did they 
battle? Did they breed?). Instead it presents a 
nuanced hypothesis, influenced by Stringer’s 
thinking, that Neanderthals could not or did 
not adapt to changing climate and competi- 
tion for resources, and slowly died out. To 
illustrate this possibility, an array of Nean- 
derthal tools, including hand-axes and stone 
blades, sits beside implements fashioned by 
Homo sapiens. These tend to be lighter, more 
varied and made using more efficient tech- 
niques. Early humans went on to develop 
even more complex tools and art, and to 
conduct elaborate burials. 

Weather seems to have had a dispropor- 
tionate role in British life from the start. The 
exhibition explains how the rise and fall of 
migrants was closely tied to climate change. 
Displays detailing the plants and animals 
that ancient Brits encountered drive this 
message home: one is an empty room, rep- 
resenting a period 450,000 years ago, when 

Britain was iced over and uninhabitable. 
Even weary museum-goers enticed 
% by the empty benches will soon 
be chased off by a soundtrack 
of howling winds. 

The discoveries un- 
earthed by the Ancient 
Human Occupation 
of Britain project have 
twice pushed back the 
date of Britain’s earli- 
est human occupa- 
tion. “With further 
work there could be 
even older evidence 
of humans in Britain,” 
Stringer says. Revel in it: 
yet more immigrants. = 


Ewen Callaway is a 
senior news reporter for 
Nature in London. 
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NICK HIGGINS 


Santiago Calatrava’s auditorium in Tenerife, Spain: tinted concrete flecked with white ceramic. 


Andrea Hamilton 
Concrete conservator 


Materials scientist and engineer Andrea Hamilton at the University of Strathclyde in Glasgow, 
UK, uses chemistry research to conserve the structural and aesthetic integrity of concrete. She 
talks about dams, nuclear-waste storage and an artwork that explores weathering. 


How did you 

get interested 

in materials 
engineering? 

I have a very emo- 
tional response 
to architecture. 
Cement and con- 
crete can be seen 
as synonymous 
with poverty and degradation, even war. But 
when I started to study it, I gained a greater 
aesthetic appreciation. 


What are some examples of beautiful 
concrete structures? 

The Catalan architect Antoni Gaudi used 
cement and concrete in novel ways. Rather 
than modular cookie-cutter structures, you 
see curvaceous, colourful forms because pre- 
cast, reinforced concrete allowed him to push 
the boundaries of structural engineering. Sty- 
listically, my favourite is an auditorium by 
Santiago Calatrava in Tenerife, Spain. Made 
from steel-reinforced concrete, it looks like a 
huge wave, curving downward into a narrow 
point, and its 3,500-tonne weight is supported 
by just five points of contact. The surface is 
reflective; Calatrava tinted the concrete using 


titanium dioxide and decorated the surface 
with fine pieces of white ceramic. 


Why is concrete so hard to conserve? 
Cement is chemically complex — more so 
than any other building material. Portland 
cement, the most common type, is made from 
clay and limestone, heated to around 1,450°C. 
This forms clinker, containing at least 70% 
calcium silicate minerals, which is ground 
and mixed with a small amount of gypsum 
to create cement powder. Mixed with water 
and aggregate, that makes concrete. There are 
lots of variables involved, particularly in mix 
design. I have worked on the conservation of 
cement-rendered brick buildings at the East 
Fortune airfield in Scotland. Built during the 
Second World War, they were not intended 
to last. Frost damage has made the cement 
render peel. 


How has ancient concrete survived? 

The Romans must have experimented. They 
used lime mixed with volcanic ash, producing 
a relatively aluminium-rich system. Recent 
research has shown that Al-tobermorite [a 
calcium silicate hydrate mineral] formed in 
ancient harbour concrete such as a break- 
water in Pozzuoli Bay near Naples. This 
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enhanced the concrete’s durability, even in 
sea water. The Romans also added animal 
hair as fibre reinforcements, and blood for 
frost resistance — both excellent innovations. 


How can we make concrete more 
environmentally friendly? 

Every tonne of Portland cement produced 
generates about 1 tonne of carbon dioxide, 
half of it from the chemical reaction when you 
heat limestone. You can't avoid carbon diox- 
ide coming off when you heat limestone, but 
you can control how much Portland cement 
goes into a mix. To me, the most promising 
research on the ecological side focuses on 
incorporating more industrial by-products. 
Up to 80% of the cement can be replaced with 
fly ash or slag, although it takes longer to set. 


How can science aid concrete’s longevity? 
Early research explored the bulk mechanical 
properties of concrete. Now research tends 
to focus on microstructures and crystallog- 
raphy to tailor clinker reactivity. Controlling 
permeability is also essential, as one of the 
main problems associated with concrete and 
cement is water ingress. With colleagues at 
Strathclyde and the Nuclear Decommission- 
ing Authority, I’m working on structures to 
house nuclear waste. One member of the 
project is developing smart sensors that can 
be embedded in concrete to detect proper- 
ties such as water and chloride content, and 
temperature, in real time. My goal is to use 
this data to develop and feed a real reactive 
transport model of concrete deterioration. 


Do concrete-rich countries such as the 
United States present particular challenges? 
The dramatic feature of Frank Lloyd Wright's 
iconic 1930s house Fallingwater is reinforced 
concrete ‘trays’ cantilevered out over a water- 
fall; in the late 1980s the cantilevers had to be 
supported from within because they were sag- 
ging. Of US infrastructure, dams are an issue: 
their average age is 52 years; it could cost more 
than US$50 billion to rehabilitate them all. 


Are you working on any art projects? 

I’m involved in Salt Licks, a fascinating 
project due to be built this summer on the 
Lincolnshire coast. It will be a 6.6-metre- 
high reinforced concrete construction, clad 
with salt blocks. The idea is that the salt will 
dissolve, allowing a dynamic shape to form. 
This represents a fantastically severe test of 
how concrete weathers in contact with salt. 
Time-lapse digital cameras will monitor it. 
We will see the immediate impact of driv- 
ing rainfall, in patterns that have never been 
visible before. My interest is in informing 
new art. Once you have a bag of cement, 
aggregate and water plus guidelines, you can 
create almost anything, anywhere. m 
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Synthetic ivory fails 
to stop illegal trade 


Synthetic ivory can now be 
crafted to the same diagnostic 
standards as genuine ivory 
(see M. E. Sims et al. Ethnobiol. 
Lett. 2, 40-44; 2011), and its 
price in China is only about 
14% of that of real ivory. First 
manufactured in 1865 to save 
elephants’ tusks from being 
turned into billiard balls (US 
patent 50359), synthetic ivory 
is not proving to be the panacea 
hoped for by conservationists 
and Chinese enforcement 
agencies. 

Because synthetic and 
authentic ivory are so similar, 
unscrupulous traders caught 
smuggling illegal ivory can claim 
that it is synthetic; they can 
also pass off synthetic ivory as 
genuine when they sell it. The 
situation may be aggravated by 
legitimate traders, because they 
are entitled to compensation if 
destructive sampling is carried 
out to conclusively distinguish 
real from synthetic ivory 
(H. G. M. Edwards and D. W. 
Farwell Spectrochim. Acta A 51, 
2073-2081; 1995). 

One of us (Z.-M. Z.), as 
an enforcement officer for 
the Yunnan Public Security 
Bureau for Forests in China, 
has investigated 57 cases 
of suspected illegal ivory 
trading since 2011. Of these, 

27 attempted to disguise samples 
of genuine ivory by mixing them 
with fake ivory, and only 513 

of 1,714 items actually proved 

to be synthetic. These litigants 
all voluntarily submitted 

their products to destructive 
sampling, saying that they were 
deceived by their own suppliers. 

The illegal ivory trade does 
not seem to be diminishing: 
in November 2013 alone, for 
example, customs agents in 
Xiamen, China, seized around 
12,000 kilograms of imported 
ivory, and three Chinese citizens 
were arrested for smuggling 
1,800 kg of ivory from Tanzania. 
Zhao-Min Zhou* Yunnan 
Public Security Bureau for 


Forests, Kunming, China. 
zhouzm81@gmail.com 

*On behalf of 4 co-signatories; see 
go.nature.com/prbm3o for full list. 


GDP: signposting a 
false choice 


Robert Costanza and colleagues 
suggest that it is time to abandon 
gross domestic product (GDP) 
as a measure of national success 
(Nature 505, 283-285; 2014). 
However, I feel that your choice 
ofan illustration ofa signpost 

to make this point conveys the 
wrong message — namely, that we 
must decide between economic 
growth and sustainable progress. 

The authors’ point is that 
GDP growth does not equate to 
economic growth: it isa measure 
of currency transactions but not 
of net value creation. Economic 
growth, properly considered, is 
an increase in net value produced 
and a rise in capital assets that can 
be used for value production. 

Modern scientific thinking 
classes natural ecosystems as 
capital assets, so we should 
include the protection of 
ecosystems as value production 
and the destruction of 
ecosystems as value depletion. 
Likewise, improvements in 
social systems and general social 
well-being create value. 

The genuine progress indicator 
(GPI) incorporates economic 
growth as defined in these terms, 
so there is no dichotomy: the 
signposts should be pointing in 
the same direction. 

Bryan Long Palo Alto, 
California, USA. 
bryan@econosystemics.com 


GDP: no one metric 
can rule them all 


Robert Costanza and colleagues 
call for the development of an 
alternative metric that includes a 
broader set of information than 
gross domestic product (GDP) 
and better reflects human well- 
being (Nature 505, 283-285; 
2014). Improved metrics are 
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needed, but we believe that more 
understanding is gained from 
balancing many related metrics. 
The authors tell a common 
story of too much reliance on 
a single metric and too little 
attention to its shortcomings 
as it shapes our world view 
and influences behaviour. For 
instance, current US political 
discussion is heavily focused 
ona ‘jobs’ metric, which also 
conveys an incomplete, and 
often misleading, message. As 
with GDP, this metric does not 
distinguish between activities 
that help or hurt well-being, and 
it does not offer much insight into 
social health and sustainability. 
Individually, metrics can 
mislead and misdirect; in a set 
of complementary metrics, 
they can provide invaluable 
knowledge. Composite metrics 
are useful, but embody weighted 
components that impose and 
conceal trade-offs that should be 
understood and discussed. 
Ironically, as we increasingly 
understand the complexities of 
how the world works, policy- 
makers and the public want to 
distil this complexity into a single 
metric. Improving a metric is a 
step forwards; improving our 
use of metrics would be a leap 
forwards. 
Gregg Marland, Todd L. 
Cherry, Howard S. Neufeld 
Appalachian State University, 
Boone, North Carolina, USA. 
marlandg@appstate.edu 


Addiction: many 
factors contribute 


Your assertion that drug 
addiction is a brain disease 
(Nature 506, 5; 2014) isa 
contentious oversimplification 
— akin to always blaming a car 
crash on the car alone. 

By this analogy, addiction is 
the car crash and the brain is 
the vehicle: the person driving 
it contributes a psychological 
component, the others on the 
road represent social factors, and 
road conditions correspond to 
the environment. To promote any 
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one of these as the prime cause of 
addiction, as in the brain-disease 
model, overlooks the importance 
of the other components. 

There is a risk that such a 
reductionist biological view 
could hinder the development 
of solutions for addiction. 

It could also undermine a 
considered response to the issue 
of politically driven bans on 

the use of animals in addiction 
research — itself'a topic that has 
important social, psychological 
and environmental contributors. 
John A. Cunningham 
Australian National University, 
Canberra, Australia. 
john.cunningham@anu.edu.au 


Addiction: not just 
brain malfunction 


Irrespective of the animal-rights 
issues you discuss, we disagree 
with your one-dimensional view 
that addiction is a disease, and 
with your claim that this view 
is not particularly controversial 
among scientists (Nature 506, 
5; 2014). Neuroscience has been 
widely documented as just one 
of many important influences in 
drug addiction. 
Substance abuse cannot 
be divorced from its social, 
psychological, cultural, political, 
legal and environmental contexts: 
it is not simply a consequence 
of brain malfunction. Such a 
myopic perspective undermines 
the enormous impact people's 
circumstances and choices 
have on addictive behaviour. It 
trivializes the thoughts, emotions 
and behaviours of current and 
former addicts. It also belittles 
the significant effect that public- 
health campaigns and legislation 
exert on curbing substance abuse. 
Addiction is too complex 
to be fought on a medical- 
research front alone. A variety 
of approaches based on diverse 
levels of analysis is required. 
Derek Heim* Edge Hill 
University, Ormskirk, UK. 
heimd@edgehill.ac.uk 
*On behalf of 94 co-signatories; see 
go.nature.com/ehujzh for full list. 
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Missing link in mantle dynamics 


The discovery of crystallographic imperfections known as disclinations in the most profuse mineral in Earth’s upper 
mantle has the potential to solve a problem that has vexed mineral physicists for decades. SEE ARTICLE P.51 


GREG HIRTH 


r Vhe viscosity of Earth’s upper mantle 
controls a wide range of processes, from 
the attenuation of seismic waves and the 

rate of surface deformation after earthquakes 

to the slow, global-scale flow that is associated 
with mantle convection and the dynamics of 
tectonic plates. This viscosity is logically inter- 
preted to be dominated by the physical prop- 
erties of olivine, the most abundant mineral 
in Earth’s upper mantle, as well as in those of 
the other terrestrial planets (Mars, Venus and 

Mercury) and the Moon. In this issue, Cordier 

et al. (page 51) report how new techniques to 

analyse the microstructure of grain bounda- 
ries in olivine allowed them to discover crystal 
defects called disclinations in this mineral. 

This observation is probably a first for geo- 

logical materials, and has ramifications for 

our understanding of the processes that 
control mantle dynamics. 

More than 50 years of effort have gone into 
the experimental and theoretical characteriza- 
tion of the solid-state flow properties of oli- 
vine. Motivated by the principles of materials 
science, a sizeable fraction of these studies has 
concentrated on understanding the relation- 
ships between the mobility of crystal defects, 
primarily dislocations, and the creep behaviour 
(deformation that occurs under continued 
stress) of both olivine single crystals and olivine 
aggregates, which are known as peridotites. 

Owing to the scales of deformation in the 
Earth, the results of experiments on olivine 
must be extrapolated by many orders of magni- 
tude in spatial scale, timescale or stress. None- 
theless, excellent agreement exists between 
predictions based on such extrapolation and 
independent geophysical observations. Fur- 
thermore, the details of microstructures in 
mantle samples collected from mountain belts 
and xenoliths (pieces of rock from great depth 
that are brought to the surface in volcanic 
eruptions) show many striking similarities 
to those observed in experimental samples, 
providing a first-order validation for the 
extrapolation. However, the application 
of these relationships to mantle dynamics 
remains compromised because a fundamental 
principle for grain-scale deformation of aggre- 
gates, the Von Mises criterion, has been hard to 
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Figure 1 | Imaging polycrystalline olivine. Cordier et al.' have analysed images produced using electron 
backscatter diffraction to identify grain-boundary disclinations in olivine aggregates. Shown here is a 
cross-section of a sample deformed at high stress. Different colours denote different crystallographic 
orientations of the crystals that make up the aggregate. Scale bar, 20 um. 


reconcile with these laboratory experiments. 
The Von Mises criterion states that five 
independent slip systems — combinations of 
crystal planes and directions in which defects 
move — are required to allow homogeneous 
grain-scale deformation of polycrystalline 
aggregates; this requirement can be relaxed to 
four slip systems for inhomogeneous grain- 
scale deformation’. However, extensive work 
on olivine single crystals suggests that only 
three independent slip systems exist for rea- 
sonable stress states in both laboratory and 
natural conditions. This has led to sugges- 
tions that deformation accommodated by the 
diffusion of mineral components through a 
process called dislocation climb, or through 
grain-boundary sliding**, might account for 
the ‘missing’ grain-scale deformation required 
by the Von Mises criterion. These hypotheses 
are challenged by the extremely sluggish dif- 
fusion kinetics observed for olivine, and the 
limited range of grain size where evidence 
for grain-boundary sliding is observed. 
Now Cordier et al. suggest that the motion 
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of grain-boundary disclinations could be a 
solution to the problem. 

As noted by Cordier and colleagues, the 
idea of disclinations was first described’ more 
than 100 years ago. However, awareness of 
their significance has been growing in recent 
years, owing to advances in microscopy® and 
continued development of theory for the defect 
structure of grain boundaries’. Disclinations 
(and dislocations) can be visualized as linear 
defects related to the distortion of a cylinder; 
for example, a twist disclination is defined 
by a rotation on an axis perpendicular to a 
cut that runs along the longitudinal axis of 
the cylinder (see Fig. 1 of the paper’). Stress 
concentrations around these linear defects 
allow intracrystalline deformation at stresses 
much below the theoretical stress required to 
break a plane of atomic bonds in the crystal. 
In the present study, the authors took advan- 
tage of the high-resolution crystallographic 
technique of electron backscatter diffraction 
to identify grain-boundary disclinations in 
olivine (Fig. 1). 


SYLVIE DEMOUCHY 


Many questions remain about the efficacy 
of the disclination hypothesis for resolving 
the Von Mises problem in peridotites. For 
example, how does their density and mobil- 
ity evolve during deformation and with 
changes in thermodynamic conditions, and 
how are the dynamics of disclinations influ- 
enced by the anisotropic (direction-depend- 
ent) elastic properties of olivine? How are 
grain-boundary disclinations involved with 
grain-boundary sliding? Can the inclusion of 
disclinations in polycrystalline-deformation 
models help to resolve questions regarding 
the evolution of lattice-preferred orientations 


BIOLOGICAL TECHNIQUES 


in olivine aggregates deformed to high defor- 
mation? The latter problem is crucial for 
the interpretation of the seismic structure 
of Earth’s mantle, which in turn is currently 
our best observational technique to explore 
convective motions inside Earth. Research 
on these topics promises to provide insights 
into the viscosity of the terrestrial planets 
and into the interpretation of a broad range of 
geophysical observations. m 


Greg Hirth is in the Department of Geological 
Sciences, Brown University, Providence, 
Rhode Island 02912, USA. 


Wrapped around 


the heart 


An elastic membrane cast around a three-dimensional printed model of a specific 
heart allows diverse aspects of cardiac function to be monitored and modified, 
and paves the way to new diagnostic and therapeutic approaches. 


COLLEEN E. CLANCY & YANG K. XIANG 


long-sought goal has been to observe, 

diagnose and treat dangerous disturb- 

ances to cardiac rhythm, an unfortu- 
nately common situation that occurs when 
electrical impulses in the heart become 
disordered. The major roadblock in the quest 
to understand and treat abnormal cardiac 
rhythms (arrhythmias) is that there is no 
straightforward approach for detailed inspec- 
tion of the underlying electrical, metabolic 
and mechanical changes and the interactions 
between them. Writing in Nature Commu- 
nications, Xu et al.' have begun to address 
this deficit by developing three-dimensional 
membranes that uniquely conform to the heart 
surface, providing a platform for flexible arrays 
of multifunctional sensors. 

The devices, termed three-dimensional 
multifunctional integumentary membranes 
(3D-MIMs), offer several key features. They 
are inherently elastic and mechanically stable, 
and they provide a uniform, non-invasive 
interface to all points on the heart, allowing 
high-resolution measurement or high-den- 
sity stimulation of cardiac function (Fig. 1). 
Furthermore, the components of the attached 
sensors can be distinctively configured to allow 
a variety of functions, including electrical, 
thermal and optical stimulation, or measure- 
ment of pH, temperature, voltage or strain. 

The 3D-MIMs are made by casting a thin 
layer of silicone elastomer (a silicone-contain- 
ing, rubber-like material) over a 3D-printed 
reconstruction of the heart of interest, which 
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Figure 1 | Multifunctional monitoring. Xu et al.' 
have developed three-dimensional elastic 
membranes (3D-MIMs) that conform to the 
surface of a specific heart, creating a platform for 
arrays of multifunctional sensors and electronic 
and optoelectronic components that can stimulate 
and measure the pH, temperature, voltage or 
strain in the heart. These devices could be used 

in research applications and in the clinic for 
diagnosis, monitoring and treatment. 
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is generated using scanning techniques 
involving optical segmentation, magnetic 
resonance imaging or computed tomography. 
This process produces an artificial envelope 
that uniquely conforms to a specific heart. The 
manufacturing technique can be applied to dif- 
ferent species and, importantly, to individual 
patients. 

One of the immediate research applica- 
tions of 3D-MIMs will be to perform multi- 
ple simultaneous measurements of a working 
(beating) heart during normal or abnormal 
cardiac cycles. Such measurements include 
temperature, tension and pH, any or all of 
which can be recorded simultaneously from 
electrical signals. Currently, optical mapping 
with voltage-sensitive dyes is the best available 
technique for measuring electrical function 
with high spatial resolution in the intact heart. 
However, one of the acknowledged short- 
comings of this approach is that it relies on 
excitation—contraction uncoupling drugs that 
allow recording only in the motionless heart. 
3D-MIMs overcome this limitation, allowing 
the first possibility of high-resolution meas- 
urements of excitable and metabolic states in 
the beating heart. 

Different actuators for electrical, thermal 
and optical stimulation can also be embedded 
in 3D-MIMs. This would allow the potential 
integration of optogenetic tools” (involv- 
ing light-responsive proteins) or genetically 
coded biosensors’ for monitoring electrical 
and biochemical signals, or to locally activate 
or deactivate genes or proteins in normal or 
diseased hearts. Furthermore, such measure- 
ments or interventions could be applied to 
hearts in the resting state or when they are 
subject to various stressors. 

In the clinical realm, 3D-MIMs have almost 
limitless potential for the construction of 
patient-specific devices for diagnostic and ther- 
apeutic purposes. During open-heart surgery, 
3D-MIMs with various combinations of com- 
ponents will be useful for diagnosing the region 
ofan abnormality, such as arrhythmias, ischae- 
mia or heart failure, which will help to deter- 
mine appropriate surgical or drug treatment 
strategies. The membranes might even be used 
to deliver therapeutic interventions during 
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surgery, including localized tissue ablation, 
pacing or acute low-energy defibrillation’. 
Although the immediate potential for 
using 3D-MIMs in such open-chest surgi- 
cal scenarios is clear, chronic implantation of 
3D-MIMs in patients could prove especially 
effective. Such implantation might eventually 
allow for post-surgery monitoring involving 
continuous measurements of local metabolic, 
excitable, ionic, contractile and/or thermal 
states of the heart in response to various 
insults, diseases or therapies. These diagnos- 
tic data might then be used to activate the 
device remotely to deliver targeted electrical 
therapy without the need for further surgery. 
The membranes might also be developed as a 
platform for low-energy control or defibrilla- 
tion methods to regulate electrical turbulence 


in the heart*. In the same vein, 3D-MIMs could 
conceivably be used for localized and targeted 
delivery of stem cells, viral vectors or drugs. 
There is also the future possibility of creat- 
ing an in vivo optical-mapping system using 
externally applied dyes or internal fluorescent 
indicators. This application would require 
the development of diverse light sources that 
could be integrated into the membranes and 
provide enough power to illuminate both dyes 
and sensors locally or in the whole heart. It also 
remains to be seen whether 3D-MIMs can be 
effectively integrated with other developing 
technologies, such as stem-cell bio-patches, 
or encapsulated viral vectors or drugs. Other 
challenges to the application of these mem- 
branes for chronic implantation include power 
supply, control, durability and encapsulation. 


How to switch affinity 


The protein NRT1.1 transports nitrate ions into plants over a wide range of 
concentrations. Two studies provide structural insight into this unusual 
behaviour, but give different explanations for it. SEE ARTICLES P.68 & P.73 


YI-FANG TSAY 


o acquire nutrients from their 
surroundings, cells produce proteins 
that act as transporters, channels and 
pumps, creating passages in the plasma mem- 
brane through which molecules can pass. 
Unlike channels, pumps and transporters 
must undergo conformational changes to 
transfer substrates across the membrane. In 
plants, the transporter NRT1.1 (also known 
as CHL1 or NPF6.3) is essential for cellular 
uptake of nitrate. Unusually for a transporter, 
NRT1.1 regulates ion uptake by changing 
its affinity for nitrate ions depending on the 
availability of nitrate in the soil. However, 
the structural basis of this behaviour is not 
understood. Two articles’” in this issue pro- 
vide possible answers, by finally determining 
structures of NRT1.1 two decades after its 
identification’, 

Like enzymes, transporters exhibit an affinity 
for specific substrates, and their behaviour can 
be described by the Michaelis constant K,,, 
which determines the range of extracellular 
substrate concentrations at which a trans- 
porter can operate (a low K,,, indicates that 
the transporter has a high affinity for a given 
substrate, and a high K,, indicates low affin- 
ity). The amount of nutrient available to a cell 
can fluctuate considerably, so, to ensure that 
nutrients can be acquired over a wide range 
of concentrations, cells have high- and low- 
affinity transport systems. Typically, the two 
are mediated by distinct transporters; however, 
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NRT1.1 is a dual-affinity transporter, which 
can switch affinity by phosphorylation and 
dephosphorylation at a threonine amino-acid 
residue (designated Thr 101)’. 

Regulation of a transporter by phosphoryl- 
ation can allow cells to rapidly respond to 
nutrient fluctuation. At low nitrate concen- 
trations, NRT1.1 is phosphorylated at Thr 101, 
and functions as a high-affinity nitrate trans- 
porter, able to acquire scant nitrate from the 
soil. At high nitrate concentrations, NRT 1.1 is 
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However, although the technology is in its 
infancy, the development of a 3D-MIM 
prototype represents a breakthrough technol- 
ogy with the potential to significantly expand 
diagnosis and treatment options for common 
cardiac-excitability disorders. m 
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not phosphorylated and mediates low-affinity 
uptake of the abundant substrate’ (Fig. 1). But 
how Thr 101 phosphorylation regulates K,, 
is yet to be defined. The two new studies, by 
Sun et al. (page 73) and Parker and Newstead? 
(page 68), provide different, although not 
necessarily conflicting, explanations. 

A transporter has three main conforma- 
tional states: outward-facing (substrate site 
open to the external environment), occluded 
(closed to both sides of the membrane) and 
inward-facing (open to the cell cytoplasm)°. 
Using crystallography, the two groups ana- 
lysed the inward-facing conformation of 
unphosphorylated NRT1.1, and report essen- 
tially identical structures. NRT1.1 is known 
to be comprised of 12 transmembrane helices 
(TMHs), and the researchers observed that — 
as with other members of the major facilitator 
superfamily (MFS) of membrane transporter 
proteins — the TMHs form a cavity open to 
the cytoplasm, within which the substrate 
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Figure 1 | Changing the affinity of the NRT1.1 protein for nitrate. The nitrate transporter NRT1.1, 
which spans the plasma membrane of plant cells, is responsible for the uptake of nitrate from the external 
environment into the cell cytoplasm. a, When nitrate concentration is low, NRT 1.1 is phosphorylated (P) 
at the amino-acid residue Thr 101, giving it a high affinity for nitrate. b, When nitrate is abundant, 

Thr 101 is not phosphorylated, and NRT1.1 acts as a low-affinity transporter’. Sun et al.' and Parker 

and Newstead’ have resolved the structure of unphosphorylated NRT1.1, and find that it forms a dimer. 
Although Sun et al. propose that this dimerization itself alters affinity, Parker and Newstead suggest that 
dephosphorylation decreases structural flexibility, modulating nitrate uptake. 
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can bind. Unusually for an MFS protein, 
NRT1.1 crystallizes as a dimer with its two 
monomers in the same orientation. Thr 101 
is located close to the dimer interface, at the 
cytoplasmic end of one TMH. On the basis of 
data from multiple lines of analysis, Sun et al. 
propose that Thr 101 phosphorylation affects 
the dimerization of NRT1.1, as is seen in some 
transmembrane receptors. But how dimer 
decoupling could regulate the K,, of NRT1.1 
remains undetermined. 

By contrast, Parker and Newstead question 
the physiological significance of NRT1.1 
dimerization, suggesting that it might occur 
only as a consequence of crystallization and not 
in cells. To test the function of Thr 101 phos- 
phorylation, the authors engineered a mutant 
form of NRT1.1 in which Thr 101 was replaced 
by aspartate (forming a Thr101Asp mutant), a 
change that mimics permanent phosphoryla- 
tion. By comparing the behaviour of Thr 101- 
and Thr101Asp-containing proteins, they 
found that the ability of nitrate to bind NRT1.1 
is independent of phosphorylation. 

Parker and Newstead also showed that the 
Thr101Asp-containing protein has a much 
higher transport rate (measured by hydro- 
gen-ion uptake, which is used as a surrogate 
for measuring nitrate uptake) and a lower 
melting temperature (indicative of increased 
structural flexibility) than the wild-type pro- 
tein. This led the authors to propose that phos- 
phorylation increases transport rate as a result 
of enhanced structural flexibility, and that the 
change in K,, is due to this increase in the rate 
of nitrate transport. The seemingly disparate 
conclusions of the two papers can, however, 
be reconciled — dimer decoupling, induced 
by phosphorylation, might increase structural 
flexibility. 

One caveat to Parker and Newstead’s data 
is that it cannot be ascertained with which 
conformation — inward-facing or other con- 
formational states — their biochemical meas- 
urements correlate. It is not known whether 
substrate affinities are identical in different 
conformations in MFS transporters, and so 
the authors cannot prove that phosphorylation 
does not affect the binding affinity of NRT1.1 
for nitrate. To determine this, a measurement 
must be taken in which the protein is fixed in 
the outward-facing conformation, in which K,, 
is defined. 

Both groups’ structures hint that histidine 
residue 356 (His 356) is the nitrate-binding 
site. Sun and colleagues replaced His 356 with 
alanine, and observed that both high- and 
low-affinity nitrate-uptake activities were 
eliminated. This suggests that the His 356 
nitrate-binding site is required for both 
high- and low-affinity transport. However, 
His 356 is not evolutionarily well conserved 
in the nitrate transporters of the NRT1 fam- 
ily, suggesting that other NRT1 transporters 
might use a different binding site for nitrate, 
or that there are multiple possible steps on the 


nitrate-transporting path. Differences between 
substrate-binding sites might explain why 
NRT1 family members transport substrates 
as diverse as nitrate, peptides and hormones’. 

The TMH regions of the independently 
solved structures are almost identical, but there 
are noticeable differences in TMH-connecting 
loops. These loops, especially the large central 
loop connecting TMH6 and TMH7 (which, 
unfortunately, is not well resolved in these 
papers) might be required for an extra function 
of NRT1.1: that of a nitrate receptor®. Sun 
et al. suggest that the TMH loops might con- 
fer receptor activity by providing docking sites 
at which other proteins can interact with the 
transporter. 

The transceptor (a transporter with receptor 
activity) role of NRT 1.1 might not be unique — 
the glucose transporter GLUT2 may function 
as a sugar transceptor to regulate insulin secre- 
tion and mediate absorption of sugar from the 
intestine®, and more transceptors are likely to 
be identified. In conjunction with biochemi- 
cal analyses, structural analyses of NRT1.1 
in other conformations, for example the less- 
stable phosphorylated form or in complex with 
other signalling components, will further our 
understanding of how nitrate transport and 
sensing are coupled and regulated. 

Enhancing the efficiency with which plants 
take up nitrate could potentially increase crop 
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yields. If plants have higher nitrate uptake, 
this would lower the amount of nitrate that is 
washed off into water systems, possibly reduc- 
ing the pollution of aquatic ecosystems. The 
reduced need for nitrogen fertilizer could 
also lower the world’s energy burden, because 
production of nitrogen fertilizer accounts 
for 1% of global energy consumption each 
year. As such, development of techniques to 
manipulate nitrate uptake is an important 
agricultural issue. Resolution of the struc- 
ture of NRT1.1 could now render structure- 
based understanding and manipulation of 
uptake feasible. = 


Yi-Fang Tsay is at the Institute of Molecular 
Biology, Academia Sinica, Taipei 11529, 
Taiwan. 

e-mail: yftsay@gate.sinica.edu.tw 


1. Sun, J. et al. Nature 507, 73-77 (2014). 
2. Parker, J. L.& Newstead, S. 507, 68-72 (2014). 
3. Tsay, Y.-F. Schroeder, J. |., Feldmann, K. A. & 
Crawford, N. M. Cell 72, 705-713 (1993). 
4. Liu, K. H. & Tsay, Y.-F. EMBO J. 22, 1005-1013 
(2003). 
5. Ho, C.H., Lin, S.H., Hu, H. C. & Tsay, Y.-F. Ce// 138, 
1184-1194 (2009). 
. Yan, N. Trends Biochem. Sci. 38, 151-159 (2013). 
. Leran, S. et al. Trends Plant Sci. 19, 5-9 (2014). 
. Leturque, A., Brot-Laroche, E. & Le Gall, M. 
Am. J. Physiol. Endocrinol. Metab. 296, E985-E992 
(2009). 


OND 


This article was published online on 26 February 2014. 


Hardware for a 
quantum network 


An optomechanical device has been designed that converts radio-frequency 
electrical signals into laser light. The system could allow computers to share data 
in a future quantum network based on optical fibres. SEE LETTER P.81 


MIKA A. SILLANPAA & PERTTI J. HAKONEN 


uantum computers are coming. In fact, 

computing technologies that are based 

on the peculiar laws of quantum physics 
are already a reality in some niche applications. 
These machines perform computations by 
manipulating quantum bits (qubits) of infor- 
mation, instead of classical bits. Because qubits 
can be ‘zero’ or ‘one’ as well as both of these 
values at the same time, they can encode more 
information than their classical analogues. 
Therefore, in certain applications, a quantum 
computer could calculate a multitude of par- 
allel tasks much faster than supercomputers 
will ever be able to do. This ‘qubit crunching’ 
potential aside, quantum computers will even- 
tually need to communicate with one another 
through a network. On page 81 of this issue, 
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Bagci et al.' describe how they have created a 
device that brings this goal a step closer. 

The gadget devised by Bagci and colleagues 
uses concepts from an established field of 
physics called optomechanics, which stud- 
ies the interaction between electromagnetic 
radiation and mechanical motion. When a 
photon strikes a mirror, it exerts a feeble kick 
on it, forcing the mirror to move. This effect 
can be boosted by placing the photon inside a 
cavity — an arrangement of two highly reflective 
mirrors that allows the photon to bounce back 
and forth many times between the mirrors. 

The field of cavity optomechanics has made 
dramatic advances in the past few years’. How- 
ever, a challenge has been to make devices that 
convert radio signals into optical ones while 
preserving the fragile quantum information 
that the signals encode™. Irrespective of how 
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qubit platforms in a future quantum com- 
puter will look, they will probably be operated 
using electrical signals of radio or microwave 
frequencies, because known scalable qubit 
designs fall in this frequency range. Opti- 
cal signals can be efficiently transmitted by 
means of optical fibres. Therefore, such radio- 
to-optical converters would allow quantum 
processors to exchange information reliably 
in a network based on optical fibres. And this 
is where Bagci and colleagues’ device comes 
in: the authors demonstrate a system that 
acts as such a converter, and that will be par- 
ticularly useful for qubit platforms made from 
superconductors’. 

Bagci and co-workers’ system consists of a 
silicon nitride membrane that is coated with 
aluminium and so acts like a mirror. Because 
of its metal coating, the membrane also con- 
ducts electricity and thus can ‘talk’ to elec- 
trical signals. Such signals can be stored as 
radio-frequency photons in a cavity formed 

by the membrane 


The neat aspect and a planar metallic 
of Bagciand ciedoiarnat Met 
collencucs’ ing electrical signals 

ti Su that to the cavity, the 
sod ° authors were able to 
I fe ou e ; set the membrane in 
able to workin vibrational motion. 
the quantum Simultaneous irra- 
regime. diation of the device 


with a laser beam 
allowed the vibrations to be converted into an 
optical signal embedded in the laser beam as 
it was reflected from the mirror. 

Conversion between electrical signals and 
laser light is a standard technique in modern 
communications technology; it is achieved 
using devices such as voltage-modulated insu- 
lating crystals, photoemitters and photodetec- 
tors. The neat aspect of Bagci and colleagues’ 
system is that it should be able to work in the 
quantum regime, in which currently known 
techniques are likely to fail. Although the 
authors did not show their device operating in 
this domain, the system's conversion efficiency 
is orders of magnitude better than that of 
current communications technology. Once 
operation in the quantum regime is attained, 
one could imagine using the device to test 
features of quantum mechanics and as a 
component ofa future quantum network. 

The device also compares favourably with 
present-day electronic amplifiers, such as low- 
temperature electromechanical amplifiers’. In 
the authors’ set-up, noisy electrical signals are 
converted into an optical beam that has a low 
noise component. Such low-noise amplifica- 
tion represents an advance over investigations 
of other converters**. Hence, in addition to 
its potential use in quantum communication, 
the system may provide an alternative means 
of signal amplification at low-noise level in 
various applications that involve narrow band 
frequencies, such as high-resolution nuclear 


magnetic resonance imaging. But there are 
also challenges ahead, including the need 
to shift from the radio to the higher, micro- 
wave frequency range, which is relevant for 
most potential applications, as well as for 
processing the output optical signal. We 
await further developments in this field with 
great interest. m 
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Prediction is 
worth a shot 


Predicting the long-term evolution of influenza is difficult. But a model incorporating 
the effects of deleterious and beneficial mutations has met the more tangible goal 
of predicting year-to-year frequencies of viral groups. SEE ARTICLE P.57 


KATIA KOELLE & DAVID A. RASMUSSEN 


“prediction is difficult, especially about 

the future”. Perhaps even more so than 
in physics, this statement holds true in biol- 
ogy. Nevertheless, biologists have repeatedly 
attempted to predict how their systems will 
fare in the future. When it comes to systems 
that evolve rapidly in response to changes in 
their environment, prediction becomes even 
harder. Despite this formidable challenge, on 
page 57 of this issue, Luksza and Lassig’ tackle 
the problems of predicting how the influenza 
virus will evolve from one year to the next, and 


Pp hysicist Niels Bohr famously said that 


Previous season’s strains 


how such prediction can improve seasonal-flu 
vaccines. 

Certain aspects of influenza’s evolutionary 
dynamics are, if not predictable, at least highly 
repetitive. Because the virus infects up to 15% 
of the human population each year’, most 
individuals have some degree of immunity. 
However, new strains carrying mutations in 
epitopes (protein regions that are recognized 
by human antibodies) regularly arise. These 
strains initially have a fitness advantage over 
previously dominant strains because they 
can more effectively escape a host’s immune 
response. As a result, they rise in frequency 
and, in doing so, deplete their own supply of 


Next season’s clade frequencies 


Figure 1 | Forecasting changes in influenza-clade frequencies. Luksza and Lassig’s model’ predicts the 
frequencies of viral clades (blue, green and purple) in one season from the frequencies and fitness values 

of individual strains in the previous season. An individual strain’s fitness is inferred by incorporating the 
beneficial effects of mutations in epitope regions of the viral haemagglutinin protein (inset, black dots) and 
the deleterious effects of mutations in non-epitope regions (inset, red dots). (Haemagglutinin image: David 


S. Goodsell/http://doi.org/c3ngkh.) 
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susceptible hosts such that even newer strains 
gain the advantage. This continuous process 
of evolution — referred to as antigenic drift — 
results in rapid turnover of the viral population 
and thus the possibility of an individual becom- 
ing reinfected with flu within a few years. More- 
over, it leads to the need to regularly update the 
composition of seasonal-flu vaccines’. 

Although flu’s antigenic drift is well charac- 
terized, predicting exactly what strains carry- 
ing which antigenic mutations will circulate in 
the future remains problematic. This is largely 
because the stochastic nature of the mutational 
process itself leads to uncertainty over what 
mutations will arise. Apart from this, even 
predicting the fate of strains currently resid- 
ing in the population is a formidable challenge, 
because multiple strains carrying different 
combinations of mutations co-circulate and 
to some extent compete with one another for 
susceptible hosts. The predictive model that 
Luksza and Lassig present addresses this prob- 
lem by targeting a somewhat more manage- 
able question: can one predict changes in the 
frequencies of groups of viral strains (clades) 
from one year to the next? The answer seems to 
be yes, and with considerable accuracy. 

At its core, Luksza and Lassig’s model 
predicts viral clade frequencies in a given year 
using strain frequencies and fitness values 
from the preceding year (Fig. 1). Its effective- 
ness therefore relies on how accurately the 
model assigns fitness values to strains, a dif- 
ficult task to do well, given that we have little 
understanding of how any individual mutation 
affects fitness. To make this task feasible, the 
authors consider only the fitness effects of two 
classes of mutation — epitope and non-epitope 
mutations — in the haemagglutinin surface 
protein of the virus. 

Although simple, this approach has a clear 
biological rationale. Mutations at epitopes are 
likely to be beneficial to the virus, because 
they alter the structural features targeted by 
host antibodies. Thus, a strain can have higher 
fitness than its competitors by being antigeni- 
cally more distinct from previously circulat- 
ing strains. By contrast, mutations outside 
epitope regions are often deleterious because 
they reduce protein stability or upset evolu- 
tionarily conserved viral functions. By train- 
ing the model on the evolutionary dynamics 
of historical strains, the authors were able to 
estimate the fitness effects of these two classes 
of mutation, and thereby to quantify the fitness 
of currently circulating strains on the basis of 
the mutations they carried. 

Using these estimates, Luksza and Lassig 
projected clade frequencies a year into the 
future, and determined the accuracy of their 
model's predictions by comparing the mag- 
nitude of predicted clade-frequency changes 
with that of the changes observed. They found 
that their model correctly predicted growth 
in viral clades 93% of the time and correctly 
predicted decline 76% of the time. 
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The model's predictive accuracy may be 
very useful to the network of researchers who 
determine which strains to include in the 
seasonal-flu vaccine. Currently, vaccine strains 
are chosen using assays that quantify antigenic 
differences between circulating strains*. This 
approach is highly effective in some years, but 
antigenic mismatches between the vaccine 
strain and the strain that ends up dominat- 
ing the next flu season do occur. Luksza and 
Lassig’s model provides insight into why such 
mismatches might arise: deleterious muta- 
tions in non-epitope regions might suppress 
the most antigenically distinct strains that 
are prime vaccine candidates. The model 
also delivers another way of choosing vaccine 
strains — by including strain cross-immunity 
estimates as well as the inferred fitness and 
frequency of current strains. 

Although Luksza and Lassig’s model pre- 
sents a new perspective on what contributes 
to viral fitness and what aspects of flu evolu- 
tion can feasibly be predicted, it also points 
to necessary future work. First, the model 
assumes a simple relationship between the 
genetic distance between strains and the extent 
of cross-immunity that they induce in a host. 
However, antigenic analysis has shown that 
some amino-acid changes in epitope regions 
have only a slight antigenic effect, whereas 
others have a pronounced one’. Incorporat- 
ing a more empirically informed ‘map’ of the 
associations between viral genotypes and their 
antigenic characteristics might increase the 
model’ predictive power. Second, although the 
authors have already taken the important step 
of broadening predictive models to include 


the effects of non-epitope mutations in the 
haemagglutinin protein, viral fitness also 
surely depends on the virus’s seven other 
gene segments. Incorporating whole-genome 
analysis into the model may therefore further 
improve prediction, especially if mutations 
interact in a non-additive manner (epistati- 
cally) across gene segments. 

Finally, in terms of strain selection for vac- 
cines, it is worth bearing in mind that the 
ultimate goal of vaccination might not be to 
reduce the number of flu infections, but rather 
to minimize the number of flu deaths or the 
overall economic cost of infections®. Luckily, 
modifying the model to incorporate such aims 
seems relatively straightforward, provided that 
sufficient data are available to quantify how 
viral strains differ in virulence or other relevant 
properties. Thus, although further work is 
needed, it is clear that Luksza and Lassig have 
significantly advanced the difficult task of flu 
prediction, especially about the future. m 
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Inflammation lights 
the way to metastasis 


Tumour spread is the main cause of death in patients with melanoma. Exposure of 
melanoma to ultraviolet radiation has now been found to cause an inflammatory 
response that drives the formation of distant metastases. SEE LETTER P.109 


SETH B. COFFELT & KARIN E. DE VISSER 


elanoma is the deadliest form of 
Me cancer. The main risk factor 

for its development is ultraviolet 
(UV) radiation from the sun, which directly 
induces alterations in the DNA of melanocytes, 
the skin’s pigment-producing cells’. Once a 
melanoma has escaped from its primary loca- 
tion to distant organs and formed metastases, 
there is only a limited chance that the cancer 
can be controlled. But what causes melanoma 
to spread? In this issue, Bald et al.* (page 109) 
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have uncovered another surprising role of UV 
radiation in this cancer type — it promotes 
metastasis through the activation ofan inflam- 
matory response. 

Besides inducing mutations in melanocytes, 
exposure to UV radiation disrupts the epithe- 
lial cells (keratinocytes) that form the skin’s 
outer layer, and also causes inflammation. 
But whether these other damaging effects of 
the radiation influence melanoma progres- 
sion was unknown until now. Using a trans- 
genic mouse model that closely mimics the 
human disease, Bald and co-workers observed 
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Figure 1 | Exposure to ultraviolet radiation drives metastasis. UV radiation directly induces DNA 
alterations that lead to melanomas. It also initiates other processes in the skin, including inflammation 
and damage to keratinocytes in the outer skin layer. Bald et al.” show that keratinocyte secretion of 
HMGBI — a protein released by dying or stressed cells — initiates an influx of neutrophils. These cells 
produce inflammatory molecules, such as TNE, which stimulate the formation of new blood vessels and 
induce the migration of melanoma cells along these vessels, thereby facilitating the cancer’s spread to 


distant organs. 


that repetitive UV irradiation of developing 
tumours increased the formation of metastases 
in the lung, without affecting the growth of the 
primary melanoma. 

Two observations led the authors to hypoth- 
esize that non-cancer cells were involved in 
driving metastasis. First, they noted that irra- 
diation of melanomas caused a striking influx 
of neutrophils — white blood cells that are key 
players in immune defence and inflamma- 
tory disorders. Second, irradiation induced 
the migration of melanoma cells towards 
blood vessels and their subsequent move- 
ment along the vessels’ surfaces. The authors 
saw that, near the tumours, the neutrophils 
were attracted to UV-damaged keratinocytes 
by a molecule called HMGB1, a protein nor- 
mally found in the nucleus of healthy cells but 
which is released by dying or stressed cells. 
Depletion of neutrophils or inhibition of their 
HMGB1-dependent recruitment reduced the 
migration of melanoma cells along blood ves- 
sels and abrogated the UV-radiation-induced 
lung metastasis. These data indicate that 
the neutrophil-rich inflammatory response 
caused the spread of melanoma cells to distant 
organs (Fig. 1). 

Through a series of in vitro experiments, the 
authors demonstrated that activated neutro- 
phils secrete the pro-inflammatory protein 
TNE, which stimulates new blood vessel 
formation and the migration of melanoma 
cells. They also studied tumour tissues from 
178 patients with melanoma and found, in line 
with their experimental data, that ulceration 


of melanomas (breakdown of the skin layer 
above the cancer) and neutrophil influx were 
associated with the presence of melanoma cells 
adjacent to blood vessels and increased 
incidence of metastatic disease. 

Thus, Bald etal. have uncovered a mechanism 
by which UV radiation triggers the activa- 
tion of non-cancer cells in the vicinity of a 
melanoma. These cells initiate a harmful 
inflammatory cascade that leads to increased 
interaction between the cancer cells and blood 
vessels, culminating in the spread of the can- 
cer cells to distant organs. The data add to the 
emerging realization that metastasis is not 
just a process that is intrinsic to cancer cells, 
but rather is one that depends on complex 
and reciprocal interactions with non-cancer 
cells in the tumour microenvironment’. As 
such, targeting UV-activated, pro-meta- 
static neutrophils, either by preventing their 
activation and accumulation or by inhibit- 
ing their downstream effects, represents a 
potential therapeutic opportunity to interfere 
with metastatic melanoma. Moreover, block- 
ing neutrophil function may be relevant to 
other cancer types, because there is evidence 
that neutrophils can promote metastasis in 
other tumour models*”. 

At present, it is unclear how Bald and 
colleagues’ findings could be translated into 
preventive measures. Many individuals who 
have precancerous lesions are unaware of 
them, and continue to expose themselves to 
the sun. Furthermore, it is still uncertain how 
much UV-radiation exposure is required to 
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induce the localized cellular changes that lead 
to metastasis in humans. As an alternative to 
preventing the harmful effects of UV radiation 
directly (through minimizing exposure), 
inhibiting its downstream effector mecha- 
nisms could also be an approach to curbing 
metastatic melanoma. Therefore, it will be 
useful to determine whether TNF is the sole 
neutrophil-derived mediator driving UV- 
induced metastasis, or whether other factors 
are important as well. Further characteriza- 
tion of pro-metastatic neutrophils, for exam- 
ple by gene expression and functional assays, 
might provide greater insight into their activity. 
Similarly, the potential pro-metastatic effects of 
other cancer-cell mutations that occur through 
repeated exposure to UV radiation should also 
be determined. Such analyses will help to better 
define UV-triggered melanoma metastasis and 
may identify new targets for this deadly disease. 

Previous work has shown that UV radiation 
not only induces melanoma-initiating muta- 
tions in melanocytes’, but also causes a distinct 
mutational signature resulting in the formation 
of altered cellular proteins (antigens) that 
are recognized as foreign by the T cells of 
the immune system®’. This abundance of 
new, UV-radiation-induced antigens largely 
explains the recent successes seen in patients 
with advanced melanoma following cancer 
immunotherapy*”, a treatment aimed at har- 
nessing the patient’s own immune system to 
attack tumours. Bald and colleagues’ dem- 
onstration that metastasis can be induced 
through the activation of keratinocytes and 
neutrophils is therefore the third known effect 
of UV radiation in melanoma. Their findings, 
together with those of others (reviewed in 
ref. 10), indicate that, in addition to activating 
antitumour immunity, inhibiting pro-tumour 
inflammatory responses is an attractive 
anticancer approach that might increase the 
number of patients with melanoma who are 
successfully treated. m 
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Disclinations provide the missing 
mechanism for deforming olivine-rich 
rocks in the mantle 


Patrick Cordier', Sylvie Demouchy’, Benoit Beausir*, Vincent Taupin?, Fabrice Barou” & Claude Fressengeas? 


Mantle flow involves large strains of polymineral aggregates. The strongly anisotropic plastic response of each individual 
grain in the aggregate results from the interactions between neighbouring grains and the continuity of material 
displacement across the grain boundaries. Orthorhombic olivine, which is the dominant mineral phase of the Earth’s 
upper mantle, does not exhibit enough slip systems to accommodate a general deformation state by intracrystalline slip 
without inducing damage. Here we show that a more general description of the deformation process that includes the 
motion of rotational defects referred to as disclinations can solve the olivine deformation paradox. We use high-resolution 
electron backscattering diffraction (EBSD) maps of deformed olivine aggregates to resolve the disclinations. The dis- 
clinations are found to decorate grain boundaries in olivine samples deformed experimentally and in nature. We present a 
disclination-based model of a high-angle tilt boundary in olivine, which demonstrates that an applied shear induces 
grain-boundary migration through disclination motion. This new approach clarifies grain-boundary-mediated plasticity 
in polycrystalline aggregates. By providing the missing mechanism for describing plastic flow in olivine, this work will 
permit multiscale modelling of the rheology of the upper mantle, from the atomic scale to the scale of the flow. 


There is a large body of work describing plastic deformation of olivine’, 
which is by far the most abundant (~60-70%) and the weakest upper- 
mantle mineral under a wide range of thermo-mechanical conditions. 
In this literature, the prevailing deformation mechanisms include the 
transport of matter by diffusion and of shear by motion of dislocations. 
However, the dislocation-based crystal plasticity of olivine is challenged 
by a lack of slip systems. For plastic flow to occur homogeneously by 
dislocation glide alone, at least five independent slip systems must 
operate, according to the Von Mises criterion’. This requirement can 
be relaxed to four slip systems if inhomogeneous flow is allowed*. In 
orthorhombic olivine (space group Pbnm), plastic slip is restricted to 
[100] and [001] directions with no possibility of shear along [010]. 
Hence, only four slip systems are available: [100](010), [100](001), 
[001](010) and [001](100). In this case, the only non-vanishing com- 


ponents of the corresponding Schmid tensors (the number of overbars 
(100}(010) _ (010) 


indicates the rank of the tensor) ™ are: mj, mi =0.5; 
mliool(oor) 2 milo” (001) 9 5, m{wr(on0) = mlso(ano) =i sani mitorican0) 
=0.5. Thus, the slip systems [100](001) and [001](100) 


are not linearly independent, and only the three-system sets ([100] 
(010), [100](001) and [001](010)) and ([100](010), [001](010) and 
[001](100)) are. Therefore, olivine aggregates do not fulfil Hutchinson’s 
relaxed condition, and arbitrary deformation can only be accommo- 
dated if additional degrees of freedom are provided, by climb (that is, 
dislocation motion out of the glide plane by absorption/emission of 
point defects) for instance. However, if climb is able to help dislocations 
to overcome obstacles, the climb rate is not efficient enough in olivine 
to provide a significant contribution to the strain (see Supplementary 
Information). Hirth and Kohlstedt* have proposed that grain-boundary 
sliding accommodated by (that is, rate limited by) dislocation motion 
could provide an alternative mechanism and account for a creep regime 


observed for olivine in which strain rate is nonlinear in stress and 
sensitive to grain size®. However, grain-boundary sliding requires dis- 
placement discontinuity’ and/or rotational discontinuity at grain bound- 
aries, which is likely to induce damage. More recently, Detrez et al.* 
have explored theoretically the possibility that the lack of slip systems 
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Figure 1 | Volterra’s distortions. a, Reference cylinder with defect line &) and 
cut surface S. b, c, Edge dislocations with Burgers vector b. d, Screw dislocation 
with Burgers vector b. e, f, Twist disclinations with Frank vector ©. g, Wedge 
disclination with Frank vector @. 
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Table 1 | The compatible and incompatible elasto-static defect theory 


Translational Rotational 
Displacement vector U Rotation vector — & 
O=5 curl U 
Strain tensor f= 7 (grad u+grad' z) Curvature tensor K=grad @ 
Cauchy stress tensor = é oe Couple-stress tensor Wea rad 
o= = 
Equilibrium condition for stresses dive=0 Equilibrium condition for couple stresses divM=0 


Nye’s dislocation density tensor %=curl Ue = —curl Up Diselinat oni densiey tensor O=curl®e = —curl Kp 


The left column displays the features of translational elasticity and plasticity; the right column shows the rotational counterparts. The derivation of these elements is provided in the Supplementary Information. 


in olivine could be overcome by purely diffusive (that is, linear) mechan- _ tensor % (see Supplementary Information for a summary of the elasto- 
isms operating at grain boundaries. They showed, however, that sucha __ plastic field theory of crystal defects). Similarly, the rotational incom- 


mechanism was unable to sustain a nonlinear rheology. patibility of the crystal lattice can be related to defects called disclinations. 
ee as ; ; Dislocations and disclinations were both proposed by Volterra’ to 
Disclinations in solids account for the discontinuity of elastic displacements and rotations 


Here we show that the lack of slip systems in olivine can be offset bya along surfaces in a solid containing defects (Fig. 1). In the thought 
more general analysis of plastic deformation in solids. Indeed, plastic experiment proposed by Volterra, a cut is made in a defect-containing 
deformation does not result only from the motion of dislocations. Dis- _ elastic cylinder, from which the core of the line defect, lying along the 
locations are the crystal defects arising from translational lattice incom- _ cylinder axis, has been removed. In the presence of lattice incompat- 
patibility, as measured by the Burgers vector or Nye’s dislocation density __ ibility, the cut induces rigid-body motion of one edge of the cut with 
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Figure 2 | Geometrically necessary dislocation densities in olivine. Densities along a subgrain boundary (red arrow); the [100]-glide, [010]-glide and [001]- 
are given per micrometre. a, Entrywise norm of Nye tensor & in sample glide are plotted with red, blue and green marks, respectively. With [100] and 
PoEM22 (experimentally deformed at 8% in compression at 900°C). b, Close- [001] edge dislocations prevailing, the subgrain boundary is essentially a tilt 
up, showing the «3 component with an opposite curvature on the subgrain boundary. We note the occasional presence of a twist component through sets 
boundary. c, Nye tensor components projected in the crystal reference system _ of screw dislocations at a right angle (1;,%33). 
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Figure 3 | Three maps representing the density of wedge disclinations 43 in 
three deformed olivine aggregates, and the probability of occurrence for 
sample T0548. The density is given in radians per square micrometre. (See 
below and Supplementary Information for a detailed description of the 
samples.) The local Burgers vectors arising from edge dislocations are 
represented by the blue arrows: their horizontal and vertical components are 
respectively «3 and «23 (given per micrometre). a, Sample POEM22 
(experimentally deformed 8% compression at 900 °C). This low-magnification 


respect to the other. When this motion is a translation, the defect is 
referred to as a dislocation, whose strength is the so-called Burgers vector, 
that is, the (space-independent) translation vector. A disclination is 
obtained when the motion is a pure rotation. The strength of the dis- 
clination is the relative rotation vector of the undeformed edges of the 
cut, referred to as the Frank vector. The relative displacement of the 
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map shows the pervasive occurrence of disclinations in grain boundaries. 

b, Sample T0548 (experimentally deformed 250% torsion at 1,200 °C). 

c, Sample OOM, a naturally deformed mylonitic harzburgite from the Oman 
ophiolite (Sumail massif). d, Probability of occurrence of positive (+) versus 
negative (—) wedge disclination density 033 in sample T0548. Filled circles 
indicate all positive wedges, open circles indicate all negative wedges, filled 
triangles indicate positive triple junction wedges and open triangles indicate 
negative triple junction wedges. 


edges in this rotation also gives rise to a space-dependent Burgers vector 
associated with the disclination. As rotational defects, disclinations are set 
into motion by moments of stresses. This motion results in disclination- 
mediated plasticity not accounted for by the dislocation theory. Dis- 
clinations have long been neglected in the field theory of crystal defects, 
including for deep Earth minerals, owing to the very large level of elastic 
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Figure 4 | Disclination-based modelling of the (011)/[100] tilt grain 
boundary with misorientation 60° modelled at the atomic scale in 
olivine”’. a, Initial wedge disclination density 033 (given in radians per square 
metre) used in the simulations. The triangles represent the structural units 


1.85 x 101° 


energy they involve, compared with dislocations, which precludes their 
occurrence as isolated crystalline objects'°. However, self-screened con- 
figurations, such as disclination dipoles, involve relatively small elastic 
energy levels'’”*. It was recently shown that disclination dipoles are 
pervasive along boundaries and subgrain boundaries in severely deformed 
copper, inclusion-free steel, electrodeposited aluminium thin film and 
in recrystallized titanium’’, highlighting the importance of rotational 
defects in metallic polycrystals. 


Disclinations evidenced by EBSD 


To check whether, like metals, deformed olivines contain disclinations, 
we used EBSD, which is a technique well adapted to local lattice ori- 
entation measurements. From the orientation differences A0; between 
neighbouring points separated by Ax; the elastic curvature tensor (see 
Table 1) can be captured: 


Xi 7 Ox; Ax; “) 
In fact, only six components of the elastic curvature tensor can be 
determined, because differences along the direction perpendicular to 
the surface are not available’* (see Methods). The Nye’s dislocation 

density tensor can then be approximately derived as 
ik = Oik Kam — Ki (2) 


Using this equation and considering the constraints on the measure- 
ments, five dislocation densities can be recovered in the reference frame 


54 | NATURE | VOL 507 | 6 MARCH 2014 


Energy (J m-°) 
0 7.5 x 10° 
aa 


1.24 x 101° 


proposed by ref. 21 (see Methods for the construction of this model). b, Elastic 
dilations and contractions. ¢, Shear stress o1 (given in pascals). d, Elastic 
energy density (given in joules per cubic metre). 


of the laboratory, namely 09, %13, %21, %3 and 433, if the surface lies 
along the (1,2) directions. Knowing the crystal orientation at each 
measurement point, these dislocation densities can be rotated in the 
crystallographic reference frame (they are then referred to as «;,, where 


“¢ indicates ‘crystallographic’). Figure 2a, b shows a scalar dislocation 
measure (that is, the length of the local Burgers vector per unit surface 
resulting from the edge dislocation densities 13, %3) in olivine poly- 
crystals deformed experimentally’*’®. One of the most remarkable 
features of these maps is the occurrence of numerous straight subgrain 
boundaries, especially in the larger grains. These subgrain boundaries 
are commonly observed in olivine grains’””” in naturally deformed 
mantle rocks. Made of geometrically necessary dislocations”, subgrain 
boundaries accommodate intracrystalline misorientations and repres- 
ent a first indication (largely overlooked) of the importance of these 
components in the plastic deformation of olivine. The analysis of the 
components allows us to characterize the dislocation types present in 
the sample. Figure 2c shows such an analysis performed along one of 
the subgrain boundaries (Fig. 2b). The boundary is made of [100] dis- 
locations (a{,,0{,,{;) described in the crystal reference frame and of 
[001] dislocations (a, ,«5, 53). The components corresponding to [010]- 
glide (a5, 05,053) are negligible, in agreement with the known character- 
istics of olivine. Further characterization within the samples considered 
in this study leads to the same conclusion and demonstrates that our 
analysis allows the dislocation content in deformed olivine to be resolved. 
Previously, only decoration (annealing followed by optical microscopy 
or scanning electron microscopy) or transmission electron microscopy 
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were available to characterize dislocation microstructures in olivine. This 
new technique also has another use, in that the variations in space of the 
elastic curvatures tensor yield the disclination densities: 


Oij = eur (3) 


Figure 3 presents the results of this analysis performed on the experi- 
mentally and naturally deformed samples. It shows that the grain bound- 
aries are associated with dislocation densities but also that they contain 
numerous disclinations correlating with the variations in misorientation 
along the grain boundary. Disclinations are also frequently associated 
with triple junctions. A visual inspection suggests that positive and nega- 
tive disclinations are spatially coupled in the form of dipoles. The stat- 
istical analysis of Fig. 3d demonstrates that positive and negative wedge 
disclination densities balance perfectly, not only for the set of all grain 
boundaries but also for the subset composed of triple junctions. The 
present study shows that disclination dipoles and the rotational incom- 
patibility they reveal represent a significant component of the deforma- 
tion field in plastically deformed olivines, which is localized in grain 
boundaries and complements the known contribution of dislocations. 


Disclinations-based grain-boundary migration 

To illustrate the role of disclinations on plasticity, we built a model of a 
tilt boundary in olivine with a periodic wedge disclination array. We 
started from the (011)/[100] tilt grain boundary with misorientation 
60° already modelled at the atomic scale in olivine’. Figure 4 shows 
the disclination model built on this boundary. The wedge disclination 
density spots are located on the vertices of the structural units’. The 
zigzag arrangements of the wedge disclinations result in a very effi- 
cient self-screening quasi-quadrupole configuration’. Indeed, most 
of the energy is located within the structural units identified on the ato- 
mistic model. Our disclination-based model leads to a grain-boundary 
energy of 1.3Jm ~, in excellent agreement with the value proposed by 
Adjaoud et al.' based on atomistic calculations of 1.28 J m~. When a 
shear stress is applied to this disclination distribution, the strong var- 
iations of the local shear strain within the defect-containing areas generate 
couple stresses. In response to this couple-stress field, the disclination 
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Figure 5 | Shear-coupled boundary 
migration of the (011)/[100] tilt 
grain boundary of Fig. 4. The wedge 
disclination density 633 (in radians 
per square metre) is colour-coded. A 
to C shows the downward migration 
43 (mr) of the disclinations from 0 ms to 
4.088 x 109 170 ms, when a positive shear 

=10° strain of 0.2 is applied. Black and 

Z white contours show the edge 

= dislocation density «3 (per metre). 
D shows the unloaded bicrystal after 
170 ms, revealing shear-coupled 
migration. Black and white contours 
show the positive plastic shear strain 
ei, produced by the downward 
migration of the grain boundary. For 
this particular misorientation, the 
shear coupling factor f/ = —0.8 
resulting from a downward 
migration coupled with positive 
shear is negative, but it may be 
positive for different misorientations. 
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dipoles are set into motion normal to the boundary (Fig. 5), producing 
plastic shear parallel to the boundary. The disclination dipole structure 
is maintained during this motion, which induces both migration of the 
boundary and shear of the crystal. Like other grain-boundary deformation 
mechanisms, grain-boundary migration/disclination motion is depend- 
ent on grain size. It is more apparent when the proportion of matter 
involved in grain boundaries is higher, that is, at small grain sizes. 

We note that this disclination-induced grain-boundary migration 
mechanism can exhibit linearity, or nonlinearity in stress, depending 
on the assumed relationship between disclination velocity and the 
associated driving force. A more comprehensive study of the disclina- 
tion structure, energy and mobility of grain boundaries as functions of 
misorientations is clearly needed to reach an eventual description of 
the rheology of an olivine aggregate. EBSD-based disclination imaging 
in naturally deformed olivine aggregates will allow these models to be 
established. Our model might also explain the formation of crystal- 
preferred orientations in materials deformed in the Newtonian creep 
regime”, as recently documented for olivine”. 


METHODS SUMMARY 


We used high-resolution EBSD to analyse the microstructure of four aggregates. 
These were a mylonitic harzburgite (OOM) from the Oman ophiolite (Sumail 
massif) and three olivine aggregates: a hot-pressed undeformed sample (PI-1619; 
ref. 15), a deformed sample in compression at 300 MPa and 900 °C (PoEM22) and 
a deformed sample under torsion at 300 MPa and 1,200 °C (T0548). EBSD analyses 
were conducted with a CamScan X500FE CrystalProbe. The operating conditions 
were 15kV and 2.5-nA current under low-vacuum conditions (4-5 Pa of gaseous 
nitrogen) with a 20-mm working distance. The step size was 0.15 1m or 0.2 um. 
The data were acquired and treated with CHANNELS software (http://caf.ua.edu/ 
wp-content/uploads/docs/JEOL-7000F-Oxford_Channel_5_User_Manual.pdf). 

The elastic curvature tensor K, (where ‘e’ indicates ‘elastic’) was recovered from 
the EBSD orientation maps. By additionally recovering the elastic strain field 2 
and building the curl of this field curl Z, the dislocation density tensor % can be 
inferred exactly. In this work, as in most studies, the curl term is overlooked, 
and @ is approximated by 


%=Tr(we)I—Ke (4) 
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In addition, the disclination density tensor 0 may be recovered by building the 
curl of K.. When a single planar orientation map is known, only five components 


of % and three components of 0 can be determined. If the planar directions are 
labelled (1, 2) in the sample frame, the available components are (012,013,021, 
013,033) and (013,023,033). To predict the grain-boundary structure and mobility, 
we used a theory of crystal defect fields (disclinations and dislocations) defined at 
the interatomic scale**’°. Our modelling paradigm is to account for the lattice 
incompatibility arising from crystal defects by focusing on the defect densities, 
rather than on the atoms themselves. The theory has the standard mathematical 
structure of a set of partial differential equations with boundary conditions. The 
unknown fields are the tensorial defect densities (dislocations and disclinations) 
and displacement vector fields, with standard boundary conditions on displace- 
ments and traction/moment vectors. As a consequence of this boundary value struc- 
ture, approximate solutions can be generated by using finite-element methods”. 


Online Content Any additional Methods, Extended Data display items and Source 
Data are available in the online version of the paper; references unique to these 
sections appear only in the online paper. 
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A predictive fitness model for influenza 


Marta Luksza!? & Michael Lissig! 


The seasonal human influenza A/H3N2 virus undergoes rapid evolution, which produces significant year-to-year 
sequence turnover in the population of circulating strains. Adaptive mutations respond to human immune challenge 
and occur primarily in antigenic epitopes, the antibody-binding domains of the viral surface protein haemagglutinin. 
Here we develop a fitness model for haemagglutinin that predicts the evolution of the viral population from one year to 
the next. Two factors are shown to determine the fitness of a strain: adaptive epitope changes and deleterious mutations 
outside the epitopes. We infer both fitness components for the strains circulating in a given year, using population- genetic 
data of all previous strains. From fitness and frequency of each strain, we predict the frequency of its descendent strains in 
the following year. This fitness model maps the adaptive history of influenza A and suggests a principled method for 
vaccine selection. Our results call for amore comprehensive epidemiology of influenza and other fast-evolving pathogens 
that integrates antigenic phenotypes with other viral functions coupled by genetic linkage. 


The evolution of influenza A/H3N2 is well documented by sequence 
data of several thousand strains since 1968'. Most of these data con- 
tain the gene sequence of haemagglutinin (HA), which covers one of 
eight segments of the influenza genome and is the primary locus of 
interaction with the human immune system’. Consistent with this func- 
tional role, antigenic changes in the HA epitopes carry the adaptive 
evolution of the pathogen’. 

Evolutionary analysis has a particular role for influenza: it serves 
not only to reconstruct the dynamical process and its causes, but to 
predict future changes**. Any prediction of evolution is essentially an 
estimate of fitness differences between strains. It is these differences 
that lead to deterministic changes in population frequency, which are 
predictable if we know how fitness depends on genotype and host 
environment. Predictability is limited by stochastic events, which range 
from mutations in individual viral sequences to sampling in host-to- 
host transmission. Predictions of influenza HA evolution can inform 
vaccine selection if, despite this limitation, they are sufficiently accurate 
from one year to the next. Currently, the selection of vaccine strains is 
based primarily on haemagglutination inhibition assays, which are used 
to map antigenic changes between viral strains’’. But the fitness of a 
strain is a complex phenotype, which integrates antigenic properties 
with multiple other molecular functions, one of which is simply the 
thermodynamic stability of proteins'*"*. Because there is no recombina- 
tion, the evolution of these functions is strongly coupled, at least within 
each genomic segment””® (whereas genetic linkage between segments 
is reduced by reassortment"*). Here we show that this coupled dynamics 
can be captured by a fitness model that predicts the evolution of influ- 
enza from genomic data. 


Clades as units of prediction 


Our analysis is based on a sample of 3,944 unique HA coding sequences 
obtained from influenza A/H3N2 isolates between 1968 and 2012 (ref. 1), 
partitioned into half-year seasons (Methods). The HA sequences of a 
given season differ from each other by several epitope and non-epitope 
nonsynonymous point mutations. To quantify this diversity, we can 
estimate the population frequencies of mutant alleles at individual RNA 
sites, of combinations of mutant alleles at two or more sites, and of indi- 
vidual strains. From an epidemiological point of view, the frequency of 
a strain is simply the fraction of the infected host individuals corres- 
ponding to that strain’®. We infer the genealogy of these strains by an 


ensemble of trees; see Methods for details of frequency estimation and 
tree reconstruction. We can then trace the evolution of strain lineages 
or clades, which are defined as sets of strains descending from a com- 
mon ancestor (Fig. 1). Whereas strains are typically observed only ina 
single season, clades have an evolutionary history that extends up to 
about 5 years and ends with fixation or loss*. Clades destined for fixa- 
tion originate on the so-called trunk of the tree; all other clades are destined 


} Last common 


ancestor 


Winter 2001-02 
Summer 2002 
Winter 2002-03 


Marked clade: 


Winter 2001-02 
Summer 2002 
Winter 2002-03 


Figure 1 | Evolution of influenza clades. The figure shows a partial influenza 
strain tree, which is based on strains observed in years 2002 and 2003 (bullets 
and circles). Each strain i has a frequency x; in its season’s strain population. 
Our units of prediction are clades, which are defined as sets of strains 
descending from recent last common ancestors. For one of these clades, we 
mark its strain content in winter seasons t = 2002 and t + 1 (light-colour 
bullets) and its last common ancestor (blue diamond). Each clade is linked by 
a set of mutations to the last common ancestor of all strains in year t (black 
diamond); codon position and target amino acid of these mutations are 
indicated for the marked clade. A clade v observed in season t has a frequency 
X,(t), which is the sum of the frequencies of its strains in season t. The marked 
clade grows substantially from X,(t) = 0.08 to X,(t+ 1) = 0.86. 
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for loss (Fig. 1). The evolution of these clades is what we want to predict 
from one year to the next. A successful clade diversifies from its ancestor 
strain through subsequent mutations during its expansion in the popu- 
lation. At the same time, the same mutation often originates indepen- 
dently in different clades. It is specific combinations of mutations that 
distinguish each clade from the other coexisting clades. We make pre- 
dictions for these clades by averaging over the ensemble of equiprobable 
trees, which minimizes the effects of tree reconstruction ambiguities* 
(Methods). 

Our prediction is based on frequency and fitness data that depend 
only on information actually available at a given point in time. Consider 
aclade v containing a set of strains i with frequencies x; ina given season 
t. The observed frequency of that clade in season t, which is denoted as 
X(t), is simply the sum of these strain frequencies, X,(t) = }7j.y) Xi- 
This sum is defined as an average over strain trees, as detailed in Methods. 
Each strain has a Malthusian fitness or growth rate f; (measured in 
units of 1/year), which is to be specified by our model. Given these 
initial data, we predict the frequency of that clade in the season 1 year 
later, 


v(t+1)= 


» x;exp(fi) (1) 
izv,t 

as illustrated in Fig. 1 (for details, see Methods). Equation (1) describes 
the large-scale population dynamics averaged over many transmission 
cycles and over the yearly epidemic cycle. We restrict predictions to 
clades with frequencies X,(t) > 0.15, which are large enough for reli- 
able estimation. These clades are geographically well-mixed'”"* (93% 
of them cover two or more continents), whereas smaller clades are domi- 
nated by sampling noise and geographical bias (94% are observed on 
a single continent only). We can check the quality of our method a 
posteriori by comparing predicted and actual clade evolution, using the 
observed frequencies X,(t+ 1). 


Fitness model 


Our fitness model has two components, which describe the selection 
on epitope and non-epitope HA genotypes, respectively. Epitope changes 
are predominantly under positive selection’""’, because they affect the 
antigenic characteristics of a strain. Antigenic selection is contained 
in multi-strain epidemiological models, which describe a susceptible- 
infected-recovered (SIR) dynamics!” In this type of model, host 
individuals acquire partial immunity against infections with all strains 
of similar antigenic characteristics. Therefore, the strain growth rates fj 
depend on the population history of previously circulating strains. We 
use an SIR model to derive our minimal epitope fitness model (Methods): 
a given strain i incurs a cross-immunity load generated by all previous 
strains j, each of which generates a fitness cost proportional to its fre- 
quency x; and to the cross-immunity amplitude C (a;,a;). These ampli- 
tudes depend on the antigenic similarity of the strains i and j, which is 
encoded in the epitope segments of their HA sequences a; and a;. We 
neglect higher-order antigenic interactions involving more than two 
strains’ and the birth—-death turnover of the host population, which 
can be argued to produce only subleading effects in the epidemiology of 
influenza A/H3N2. 

Non-epitope mutations are predominantly under negative selection’, 
because they affect protein stability and other conserved molecular 
functions'*"*. Here we describe these effects by a simple mutational- 
load model: each strain incurs a fitness cost £(a;) that is the cumulative 
effect of recent non-epitope amino acid changes, which occur in its 
ancestral lineage in the current season (Methods). 

Together we obtain a strain fitness of the form 


fi=fo—L(a)— S> %C(aiaj) (2) 
ft <ti 


with a constant fy ensuring the correct normalization of strain frequen- 
cies (Methods). Importantly, this strain-based model goes beyond a fit- 
ness model for individual mutations: it counts each new beneficial or 
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deleterious mutation together with the previous changes in its ancestral 
lineage. 

The simplest fitness model of this form has uniform selective effects: 
each non-epitope mutation generates a fitness cost oy, and each epi- 
tope mutation reduces the cross-immunity amplitude by an amount 
Gep. However, the biology of cross-immunity and protein stability 
deviates from this model. Both phenotypes are non-uniform and non- 
linear functions of genetic distance*’*"'**; that is, the effect of a muta- 
tion depends on its sequence position, on the amino acids involved, 
and on its background of previous mutations. Our full fitness model 
uses nonlinear cross-immunity amplitudes C(ai.aj), and it includes 
position-specific effects and nonlinear fitness terms that are inferred 
from observed clade histories (Methods; see also a related allele-based 
inference scheme’). Importantly, this model has only four fit para- 
meters, which can be inferred from our data set without compromising 
predictive power. 


Frequency predictions for clades 


The winter-to-winter prediction for the Northern Hemisphere obtained 
from our full, clade-based fitness model is shown in Fig. 2. To deter- 
mine the accuracy of this model, we ay the predicted frequency 
ratio, or Wrightian fitness, W, (t+1 )/X(t) ) with the posterior 
observed ratio W,=X,(t+1)/X,(t) for all clades with frequencies 
X,(t) > 0.15 in a given season. ae oe points (W,, W,) in Fig. 2a 
are distributed around the diagonal of correct prediction; some scatter 
can be explained by statistical errors in frequency estimation due to 
tree reconstruction and sampling (Extended Data Fig. 1). As discussed 
below, these predictions can be improved further by broadening the 
data basis of our model. The direction of frequency evolution is pre- 
dicted with remarkable accuracy. There are 121 clades with observed 
growth (W, > 1), which we predict correctly in 93% of the cases 
(W, > 1). For the 67 clades with observed decline (W, <1), we cor- 
rectly predict decline in 76% of the cases. Importantly, the fitness 
amplitude W, is predicted accurately for the clades destined for fixa- 
tion, which have X,(t+1)~1 and appear close to the diagonal in 
Fig, 2a. 

The fixation of a clade implies the fixation of all mutations that 
appear in its ancestor strain. As shown in Fig. 2b, the yearly numbers 
of nucleotide fixations between 1994 and 2012 are also well predicted 
by our model. This pattern is well known to be clustered (80% of 
the nucleotide fixations occur in a subset of 11 years), which reflects 
recurrent selective sweeps in the evolution of influenza HA*””. 

Figure 2c maps our prediction onto the strain tree. Each clade, repre- 
sented by its ancestor strain, is coloured according to the maximum of 
the predicted frequency changes X,(t +1) — X,(t) over its history. We 
find clade expansion predominantly close to the trunk and decline far 
away from the trunk, which is consistent with the observed shape of the 
influenza tree. 

Tagging clades by their point of origination (Methods), allows us to 
analyse correlations between fitness and geographical location. For clades 
originating in east and southeast Asia, we predict growth (W, > 1) in 

77% of the cases, compared to 54% of the cases for clades originating 
elsewhere; the corresponding fractions with observed growth (W, > 1) 
are 77% and 49%. This is consistent with the particular role of east and 
southeast Asia in seeding antigenic variants, which has been established 
previously’®. Thus, our analysis captures broad spatial patterns in the 
evolution of influenza A/H3N2, although the underlying fitness model 
is geographically neutral (this point will be discussed further below). 

We can quantify the statistical information gain due to our predic- 
tion by comparing distributions of predicted and posterior next-year 
frequencies (Methods). We find the observed frequency evolution to 
be more likely by a factor >10°”° under our fitness model compared to 
a null model with constant frequencies (that is, zero fitness) for all strains. 
We emphasize that our prediction works only from one year to the next, 
because it cannot predict the new mutations that arise after its base year 
and shape the course of evolution over longer periods. 


©2014 Macmillan Publishers Limited. All rights reserved 


a T b 
4b 

= 3f 

S 

8 

2 2 

iS) 

8 

® 

a 
de 

30 20 10 0 

Or 


Predicted substitutions 


Posterior ratio, W 


0.0 1.0, 
MX (t+ 1) 


Ml Synonymous 
Figure 2 | Year-to-year predictions of HA evolution. a, Wrightian fitness: the 
predicted frequency ratio W, = X,(t+1) / X(t) is plotted against the posterior 
ratio W, =X,(t+1)/X,(t) for 188 influenza HA clades with initial frequency 
X,(t) > 0.15 observed since 1993 (error bars due to tree reconstruction and 
sampling are given in Extended Data Fig. 1). The predicted frequency X(t+ 1) 
is indicated by colour; clades destined for fixation are shown in red. Clade 
growth (W, > 1) is correctly predicted in 113 of 121 cases, clade decline in 51 of 


To test our method on a related system, we obtain clade fitness pre- 
dictions for seasonal influenza A/H1N1. This lineage has re-entered 
the human population in 1977 and evolved in a way broadly similar to 
H3N2 until 2009, when the pandemic H1N1 lineage emerged. Compared 
to the H3N2 data set, the H1N1 strain sample’” has larger regional and 
seasonal biases, potentially weaker antigenic selection’, and larger uncer- 
tainty about the exact epitope positions” (Methods). Our predictions 
for H1N1 are comparable to H3N2 but somewhat more noisy, as expected 
from their less informative strain sample (Extended Data Fig. 2). This 
establishes a proof of principle for the applicability of our model to 
other influenza strains. 


Vaccine strain selection 


Our model provides a principled method to select strains for influenza 
vaccines. By equation (2), vaccination based on a strain v reduces the 
fitness of each circulating strain i proportionally to the cross-immunity 
amplitude C(a;,a,). This causes a reduction in the total number of infec- 
tions that is proportional to the average cross-immunity between the 
vaccine strain and the circulating strains in a given season, C,(f) = 
>>. XiC(a;,a,) (Methods). The optimal vaccine maximizes this reduc- 
tion, which defines the cross-immunity centre of mass of the circulating 
strains. Equation (1) predicts next-year cross-immunity amplitudes 
C,(t+1) = 3; x1 exp(f;)C(aj,a,), which can be compared a posteriori 
with the observed amplitudes C,(t+ 1). 

In particular, we can compare the optimal vaccine strains predicted by 
our model and actual vaccine strains used in the Northern Hemisphere” 
to the posterior centre-of-mass strains observed in the following year 
(the established procedure of vaccine strain selection is described in 
Methods). Figure 3 shows this comparison for influenza A/H3N2 in 
the winter seasons from 1994 to 2012. In all years, the model-selected 
vaccine strains have a smaller amino acid distance from the cross- 
immunity centre of mass of the same season than the actual vaccine 
strains (insert of Fig. 3). This can be explained in part by differences 
between our sequence-based cross-immunity measure C(a;,a;) and the 
haemagglutination-inhibition-based antigenic distances currently used 
for vaccine selection. The latter are known to evolve in a more punc- 
tuated way’’, but we observe distance differences even in years when 
vaccine strains have been updated. These results suggest that a fitness- 
model-based prediction of influenza evolution can contribute to vaccine 
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67 cases. b, Yearly numbers of HA nucleotide fixations: predicted numbers 
are compared to posterior numbers. c, Dynamics on the influenza strain 
tree: for each clade originating between 1993 and 2010, the ancestor node 
is coloured according to the maximum of the predicted frequency changes, 
max; [X,(t +1) —X,(t)]. Our model correctly predicts expansion along the 
trunk (thick line) and loss of branches off trunk. 


strain selection; however, we caution against premature conclusions 
before our prediction scheme is carefully tested with haemagglutina- 
tion inhibition data. Our model can also be used to estimate how vac- 
cination affects the course of influenza evolution (Methods). 


Mapping the adaptive process 


The fitness effects underlying our predictions can be displayed in a 
quantitative map of influenza’s adaptive history. As key quantity we 
use the cumulative fitness flux”*”°, which measures the total amount 
of adaptation up to a given clade; this quantity is defined in Methods 
and illustrated in Extended Data Fig. 3. The map of Fig. 4 shows the 
fitness flux for 234 influenza A/H3N2 clades on a tree between 2003 
and 2008 (see Extended Data Fig. 3 for fitness flux over a longer period). 
It displays clades with multiple different values of fitness and fitness flux 
in each year. The evolution of this distribution generates a travelling 
fitness flux wave, which links influenza to recent theoretical models 
of asexual evolution*®**. The advance of the wave is measured by the 
population mean fitness flux, which is shown as a black dashed line in 
Fig. 4. This quantity measures correlations between fitness and actual 
frequency changes of clades. It can be used to compare the predictive 
power of different fitness models. The best epitope-only fitness model 
captures about 63%, and the best model with uniform selective effects 
about 57% of the cumulative fitness flux given by the full model (Extended 
Data Table 1). An information-theoretic comparison of fitness models 
shows the same ranking (Methods, Extended Data Table 1). These results 
indicate that non-epitope changes and nonlinear selective effects have 
an important role in the adaptive process of influenza and its successful 
prediction. 

The underlying mode of evolution is revealed by individual flux 
genealogies shown in Fig. 4. We observe that high-fitness clades seed 
future high-fitness clades by beneficial mutations. In particular, the 
clades on the trunk of the tree have consistently high predicted fitness 
values and are driven to fixation by multiple beneficial mutations during 
their expansion. At the same time, many high-fitness clades are eventually 
driven to loss, because they are overtaken in fitness by other competing 
clades. Individual beneficial alleles are lost if they arise in a low-fitness 
clade or if they are outcompeted by subsequent beneficial mutations in 
disjoint clades. These observations provide direct evidence of clonal 
interference in the evolution of influenza’ with pervasive effects of 
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genetic linkage on individual alleles’®. This mode of evolution is well 
known from laboratory evolution experiments with microbial and viral 
populations****, 

Clonal interference can explain the observed regional fitness differ- 
ences between influenza A/H3N2 clades as an effect of multiple bene- 
ficial mutations coexisting in a population: individual antigenic mutations 
originating in east and southeast Asia have the same average effect as 
mutations originating elsewhere, but they occur in lineages that have 
accumulated more previous beneficial mutations in their recent past. 


Discussion 


We have developed a dynamical model that successfully predicts the 
year-to-year evolution of individual influenza clades, based on epitope 


Fitness flux, ® 


2004 2005 


Year 


2003 2006 2007 2008 


60 | NATURE | VOL 507 | 6 MARCH 2014 


Figure 3 | Vaccine selection. 
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and non-epitope characteristics of their HA gene. Our general model is 
applicable whenever host-pathogen interactions—in particular, anti- 
genic selection—generate continual adaptive evolution of a predominantly 
asexual population. Our results highlight the determinants of predict- 
ive power: we need sufficient information on the genotypic and pheno- 
typic basis of antigenic and mutational-load fitness components, and 
model training requires a sufficiently deep and unbiased strain sample. 
This suggests that predictions can be improved by integrating diverse 
genotypic and phenotypic data, which include free-energy effects of spe- 
cific mutations’, haemagglutination inhibition data’’, the genomics of 
neuraminidase® and the geographical distribution of strains’*. Further- 
more, the prediction scheme can be extended from population fre- 
quencies to absolute growth rates and population numbers, which 
includes the dynamics of yearly incidence rates”. Together, we expect 
an improved understanding of selective effects for specific mutations 
from limited strain data. This is key to evolutionary predictions for other 
influenza variants, including the potentially pandemic avian A/H5N1 
and A/H7N9 lineages. 

In a broader context, our model establishes a direct link between 
population genetics and epidemiology that is to be explored more 
comprehensively in future work. This link is the strain-specific fitness 
function of equation (2), which governs the dynamics of infected host 
individuals in an SIR framework. Strain fitness depends not only on 
antigenic characteristics, but also on other phenotypes encoded in 
genetically linked sequence. We expect that this coupling between anti- 
genic adaptation and conservation of other functions is not limited to 
influenza, but is a generic feature of fast-adapting pathogens. Therefore, 
the epidemiology of such systems should be based on the ensemble of 
phenotypes linked to the adaptive process. 


Figure 4 | Adaptation map of influenza. The fitness flux ©,(t), computed 
from the fitness model (2) and observed frequency changes, is shown for 234 
clades on a tree between 2003 and 2008; see Methods for the definition and 
Extended Data Fig. 3a for an illustration of fitness flux. Top graph: strains 
within these clades are ordered by year and, within each year, by mutational 
distance to the last common ancestor. The mean cumulative fitness flux O(f) is 
shown as dashed line; see also Extended Data Fig. 3b. This map displays a 
travelling fitness flux wave. Bottom graph: the same map is shown with 
nonsynonymous epitope mutations marked by green triangles; these mutations 
are mostly beneficial’””'®. This gives evidence of clonal interference: successful 
clades are driven to fixation by multiple beneficial mutations (large green 
triangles; origination and fixation of one such clade are marked by arrows), 
whereas other beneficial mutations are driven to loss (small green triangles). 
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Beyond pathogens, this work touches upon the fundamental ques- 
tion of how predictable evolution is. Although there is clearly no gen- 
eral answer to this question, our analysis shows under what auspices 
limited predictions may be successful. 


METHODS SUMMARY 


We partition our strain sample into seasons; t = y labels the period from October 
of year y — 1 to April of year y. For the prediction from season t to t + 1, we use 
maximum-likelihood HA sequence trees of all strains up to season ¢; validation is 
based on trees for the full period. Epitope and non-epitope mutations are mapped 
onto the branches of these trees (Extended Data Fig. 4). Clades are defined as sets 
of all descendants of a given HA sequence a,; clade frequencies are estimated by 
averaging over equiprobable trees. 

Our fitness model has three components. The epitope fitness component f;” = 
Sep dS i xj¢(Dep (ai,a;)) is computed from linear amino acid distances D,» 
between pairs of sequences in epitope codons’, using a nonlinear cross-immunity 
amplitude c(D.,) = exp(—D,,/Do). The non-epitope fitness component f° = 
—GneDne (ai,a*) depends on the non-epitope amino acid distance D,,, between 
a; and the last ancestor of strain i in a previous season, a;. We show that these 
recent non-epitope mutations are under substantial negative selection” (Extended 
Data Fig. 4). The full fitness model (equation (2)) exploits an additional feature of 
the population history: synonymous mutations hitchhiking in selective sweeps” 
reinforce the inference of positive selection by a term f™ = 1D3"(v(i),a*(t)) pro- 
portional to the average neutral mutational distance of all strains in clade v(i) from 
the last common ancestor a*(t). The full model takes the form of equation (2), 
fi=f? +f +f! +o, with a constant fo given by the normalization conditions 
Yi: ri Y: ti exp(fi) =1. 

To quantify the quality of clade frequency predictions, we partition the strains 
of each season f into non-overlapping clades v€ K(t) with frequencies Y;= 
(Y, (t) vex): We define the relative information of observed versus predicted 
next-year frequencies, H(Y;+1 [Yi41) = ex) Y\(t+ Llog[Y,(t+ 1)/¥,(t)] : 
which determines the likelihood p that the observed strains are a sample of the 
predicted frequency distributions. We use this likelihood to rank fitness model 
variants, which include alternative epitopes and glycosylation effects, and to infer 
optimal parameters from strain data of 4-8 years before the base year of prediction 
(Extended Data Table 1 and Extended Data Fig. 6). 

We define the cumulative fitness flux for each non-overlapping clade v € K(f), 


t-1 

0()=F() FO) + SD A) FEI $Y (0) 
t'=to veK(t’) 

where F,(t) = log/¥, (t+1)/Y,()] is the fitness of clade v and F(t)= 

Dvexs) Fv(¢) ¥v(¢) is the mean population fitness in year t. The mean cumulative 

flux O(t) = exit y(t —1)Y,(t) is a measure of adaptation satisfying the fit- 

ness flux theorem”. Deviations from this mean reflect fitness differences between 


clades, ©, (t) —®(t) =F,(t) —F(t). 


Online Content Any additional Methods, Extended Data display items and Source 
Data are available in the online version of the paper; references unique to these 
sections appear only in the online paper. 


Received 23 April 2013; accepted 29 January 2014. 
Published online 26 February 2014. 


1. Bao, Y. etal. The Influenza Virus Resource at the National Center for Biotechnology 
Information. J. Virol. 82, 596-601 (2008). 

2. Wiley, D.C., Wilson, |. A. & Skehel, J. J. Structural identification of the antibody- 
binding sites of Hong Kong influenza haemagglutinin and their involvement in 
antigenic variation. Nature 289, 373-378 (1981). 

3. Bush, R. M., Bender, C. A., Subbarao, K., Fox, N. J. & Fitch, W. M. Predicting the 
evolution of human influenza A. Science 286, 1921-1925 (1999). 

4. Plotkin, J. B., Dushoff, J. & Levin, S. A. Hemagglutinin sequence clusters and the 
antigenic evolution of influenza A virus. Proc. Natl Acad. Sci. USA 99, 6263-6268 
(2002). 

5. Koelle, K., Cobey, S., Grenfell, B. & Pascual, M. Epochal evolution shapes the 
phylodynamics of interpandemic influenza A (H3N2) in humans. Science 314, 
1898-1903 (2006). 

6. Wolf, Y.|., Viboud, C., Holmes, E. C., Koonin, E. V. & Lipman, D. J. Long intervals of 
stasis punctuated by bursts of positive selection in the seasonal evolution of 
influenza A virus. Biol. Direct 1, 34 (2006). 

7. Shih, A. C.-C., Hsiao, T.-C., Ho, M.-S. & Li, W.-H. Simultaneous amino acid 
substitutions at antigenic sites drive influenza A hemagglutinin evolution. Proc. 
Natl Acad. Sci. USA 104, 6283-6288 (2007). 

8. Bhatt, S., Holmes, E.C. & Pybus, 0. G. The genomic rate of molecular adaptation of 
the human influenza A virus. Mol. Biol. Evol. 28, 2443-2451 (2011). 


ARTICLE 


9. Strelkowa, N. & Lassig, M. Clonal interference in the evolution of influenza. Genetics 
192, 671-682 (2012). 

10. Illingworth, C. J. R. & Mustonen, V. Components of selection in the evolution of the 
influence virus: linkage effects beat inherent selection. PLoS Pathog. 8, e1003091 
(2012). 

11. Meyer, A. G., Dawson, E. T. & Wilke, C. O. Cross-species comparison of site-specific 
evolutionary-rate variation in influenza haemagglutinin. Phil. Trans. R. Soc. B 368, 
20120334 (2013). 

12. Smith, D. J. et al. Mapping the antigenic and genetic evolution of Influenza virus. 
Science 305, 371-376 (2004). 

13. Bloom, J. D. & Glassman, M. J. Inferring stabilizing mutations from protein 
phylogenies: application to influenza hemagglutinin. PLOS Comput. Biol. 5, 
e1000349 (2009). 

14. Wylie, C. S. & Shakhnovich, E. |. A biophysical protein folding model accounts for 
most mutational fitness effects in viruses. Proc. Natl Acad. Sci. USA 108, 
9916-9921 (2011). 

15. Holmes, E. C. et a/. Whole-genome analysis of human inuenza A virus reveals 
multiple persistent lineages and reassortment among recent H3N2 viruses. PLoS 
Biol. 3, e300 (2005). 

16. Grenfell, B. T. et al. Unifying the epidemiological and evolutionary dynamics of 
pathogens. Science 303, 327-332 (2004). 

17. Rambaut,A. etal. The genomic and epidemiological dynamics of human influenza 
Avirus. Nature 453, 615-619 (2008). 

18. Russell, C. A. et a/. The global circulation of seasonal influenza A (H3N2) viruses. 
Science 320, 340-346 (2008). 

19. Gog, J. R. & Grenfell, B. T. Dynamics and selection of many-strain pathogens. Proc. 
Natl Acad. Sci. USA 99, 17209-17214 (2002). 

20. Tria, F., Lassig, M., Peliti, L. & Franz, S. A minimal stochastic model for influenza 
evolution. J. Stat Mech. PO7008 (2005). 

21. Kryazhimskiy, S., Dieckmann, U., Levin, S.A. & Dushoff, J.On state-space reduction 
in multi-strain pathogen models, with an application to antigenic drift in influenza 
A. PLOS Comput. Biol. 3, e159 (2007). 

22. Minayev, P. & Ferguson, N. Improving the realism of deterministic multi-strain 
models: implications for modelling influenza A. J. R. Soc. Interface 6, 509-518 
(2009). 

23. Rasmussen, D.A., Ratmann, O. & Koelle, K. Inference for nonlinear epidemiological 
models using genealogies and time series. PLOS Comput. Biol. 7, €1002136 
(2011). 

24. Kryazhimskiy, S., Dushoff, J., Bazykin, G. A. & Plotkin, J. B. Prevalence of epistasis in 
the evolution of influenza A surface proteins. PLoS Genet. 7, €1001301 (2011). 

25. Bogner, P. et al. A global initiative on sharing avian flu data. Nature 442, 981 
(2006). 

26. Huang, J.-W., Lin, W.-F. & Yang, J.-M. Antigenic sites of HIN1 influenza virus 

hemagglutinin revealed by natural isolates and inhibition assays. Vaccine 30, 

6327-6337 (2012). 

27. WHO Recommendations for Influenza Vaccine Composition. Retrieved from 

http://www.who.int/influenza/vaccines/vaccinerecommendations1/en. 

28. Mustonen, V. & Lassig, M. From fitness landscapes to seascapes: non-equilibrium 

dynamics of selection and adaptation. Trends Genet. 25, 111-119 (2009). 

29. Mustonen, V. & Lassig, M. Fitness flux and ubiquity of adaptive evolution. Proc. Natl 

Acad. Sci. USA 107, 4248-4253 (2010). 

30. Desai, M. M. & Fisher, D. S. Beneficial mutation selection balance and the effect of 

inkage on positive selection. Genetics 176, 1759-1798 (2007). 

31. Rouzine, |. M., Brunet, E. & Wilke, C. O. The traveling-wave approach to asexual 
evolution: Mullers ratchet and speed of adaptation. Theor. Popul. Biol. 73, 24-46 
(2008). 

32. Schiffels, S., Szollosi, G. J., Mustonen, V. & Lassig, M. Emergent neutrality in 
adaptive asexual evolution. Genetics 189, 1361-1375 (2011). 

33. Good, B.H., Rouzine, |. M., Balick, D. J., Hallatschek, O. & Desai, M. M. Distribution of 
fixed beneficial mutations and the rate of adaptation in asexual populations. Proc. 
Natl Acad. Sci. USA 109, 4950-4955 (2012). 

34. Neher, R. A. & Hallatschek, O. Genealogies of rapidly adapting populations. Proc. 
Natl Acad. Sci. USA 110, 437-442 (2013). 

35. Gerrish, P. J. & Lenski, R. E. The fate of competing beneficial mutations in an 
asexual population. Genetica 102-103, 127-144 (1998). 

36. Miralles, R., Gerrish, P. J., Moya, A. & Elena, S. F. Clonal interference and the 
evolution of RNA viruses. Science 285, 1745-1747 (1999). 


Supplementary Information is available in the online version of the paper. 


Acknowledgements We acknowledge discussions with B. D. Greenbaum, B. Grenfell, 
C. Illingworth, A. Levine, J. W. McCauley, V. Mustonen, S. Pompei and R. Rabadan. 
This work has been partially supported by Deutsche Forschungsgemeinschaft 

grant SFB 680 and by German Federal Ministry of Education and Research grant 
0315893-Sybacol. Part of this work was performed at the Kavli Institute of Theoretical 
Physics (Santa Barbara), which has been supported by National Science Foundation 
grant PHY05-51164. 


Author Contributions Both authors designed research, developed methods, analysed 
data, interpreted results and wrote the paper. 


Author Information Reprints and permissions information is available at 
www.nature.com/reprints. The authors declare no competing financial interests. 
Readers are welcome to comment on the online version of the paper. Correspondence 
and requests for materials should be addressed to M. Lassig (lassig@thp.uni-koeln.de). 


6 MARCH 2014 | VOL 507 | NATURE | 61 


©2014 Macmillan Publishers Limited. All rights reserved 


ARTICLE 


doi:10.1038/nature13011 


DNA interrogation by the CRISPR 
RNA-guided endonuclease Cas9 


Samuel H. Sternberg'*, Sy Redding’*, Martin Jinek*+, Eric C. Greene* & Jennifer A. Doudna’*>*° 


The clustered regularly interspaced short palindromic repeats (CRISPR)-associated enzyme Cas9 is an RNA-guided 
endonuclease that uses RNA-DNA base-pairing to target foreign DNA in bacteria. Cas9- guide RNA complexes are also 
effective genome engineering agents in animals and plants. Here we use single-molecule and bulk biochemical experi- 
ments to determine how Cas9-RNA interrogates DNA to find specific cleavage sites. We show that both binding and 
cleavage of DNA by Cas9-RNA require recognition of a short trinucleotide protospacer adjacent motif (PAM). Non-target 
DNA binding affinity scales with PAM density, and sequences fully complementary to the guide RNA but lacking a nearby 
PAM are ignored by Cas9-RNA. Competition assays provide evidence that DNA strand separation and RNA-DNA hetero- 
duplex formation initiate at the PAM and proceed directionally towards the distal end of the target sequence. Furthermore, 
PAM interactions trigger Cas9 catalytic activity. These results reveal how Cas9 uses PAM recognition to quickly identify 
potential target sites while scanning large DNA molecules, and to regulate scission of double-stranded DNA. 


RNA-mediated adaptive immune systems in bacteria and archaea rely 
on CRISPRs and CRISPR-associated (Cas) proteins to provide protec- 
tion from invading viruses and plasmids’. Bacteria harbouring CRISPR- 
Cas loci respond to viral and plasmid challenge by integrating short 
fragments of the foreign nucleic acid (protospacers) into the host chro- 
mosomeat one end of the CRISPR array’. Transcription of the CRISPR 
array followed by enzymatic processing yields short CRISPR RNAs 
(crRNAs) that direct Cas protein-mediated cleavage of complementary 
target sequences within invading viral or plasmid DNA*~. In type II 
CRISPR-Cas systems, Cas9 functions as an RNA-guided endonuclease 
that uses a dual-guide RNA consisting of crRNA and trans-activating 
crRNA (tracrRNA) for target recognition and cleavage by a mecha- 
nism involving two nuclease active sites that together generate double- 
stranded DNA (dsDNA) breaks®”. 

RNA-programmed Cas9 has proven to bea versatile tool for genome 
engineering in multiple cell types and organisms* '’. Guided by either a 
natural dual-RNA complex or a chimaeric single-guide RNA®, Cas9 
generates site-specific dsDNA breaks that are repaired either by non- 
homologous end joining or homologous recombination. In addition, 
catalytically inactive Cas9 alone or fused to transcriptional activator or 
repressor domains can be used to alter transcription levels at sites tar- 
geted by guide RNAs”’**. Despite the ease with which this technology 
can be applied, the fundamental mechanism that enables Cas9-RNA 
to locate specific 20-base-pair (bp) DNA targets within the vast sequence 
space of entire genomes remains unknown. 


Single-molecule visualization of Cas9 


We used a single-tethered DNA curtains assay and total internal reflec- 
tion fluorescence microscopy (TIRFM) to visualize the binding site distri- 
bution of single Cas9-RNA molecules on A-DNA substrates (48,502 bp) 
(Fig. 1a)”°*’. We purified Streptococcus pyogenes Cas9 containing a 
carboxy-terminal 3 x -Flag tag that enabled fluorescent labelling using 
anti-Flag-antibody-coated quantum dots (QDs)’’”*, and generated guide 
RNAs (dual crRNA-tracrRNA) bearing complementarity to six different 


20-bp sites within the A-DNA (Fig. 1b and Extended Data Table 1). 
Neither the 3X -Flag tag nor QDs inhibited DNA cleavage by Cas9-RNA, 
and all guide RNAs were functional (Extended Data Fig. 1). Initial experi- 
ments used a nuclease-inactive Cas9 containing D10A and H840A point 
mutations (dCas9) that binds but does not cleave DNA®. QD-tagged 
dCas9-RNA localized almost exclusively to the expected target site (Fig. 1c 
and Supplementary Video 1). Furthermore, Cas9 could be directed to 
any region of the DNA by redesigning the RNA guide sequence (Fig. 1d 
and Extended Data Fig. 2), as anticipated**’. Thus, DNA targeting by 
Cas9-RNA is faithfully recapitulated in the DNA curtains assay. 

We next used apo-Cas9 protein to confirm that the binding observed 
in DNA curtains assays was due to Cas9-RNA and not apo-Cas9 lack- 
ing guide RNA. Notably, apo-Cas9 also bound DNA but exhibited no 
apparent sequence specificity (Fig. le, f). Attempts to measure the dis- 
sociation rate of DNA-bound apo-Cas9 were hampered by their exceed- 
ingly long lifetimes (a lower limit of at least 45 min was calculated; Fig. 1g). 
Biochemical experiments revealed an upper limit of ~25 nM for the 
equilibrium dissociation constant (Kg) of the apo-Cas9-DNA complex, 
compared to ~0.5 nM for the Cas9-RNA complex bound to a bona 
fide target (Extended Data Fig. 3). 

To test whether DNA-bound apo-Cas9 could be distinguished from 
Cas9-RNA, we measured the lifetime of apo-Cas9 on DNA curtains 
before and after injection of ccRNA-tracrRNA or heparin. Apo-Cas9 
rapidly dissociated from nonspecific sites in the presence of either 
competitor (Fig. 1g), and this result was verified with bulk biochemical 
assays (Extended Data Fig. 3). In contrast, target-bound Cas9-RNA 
was unaffected by either heparin or excess crRNA-tracrRNA (Fig. 1g 
and Extended Data Fig. 3). These findings show that nonspecifically 
bound apo-Cas9 has properties distinct from those of Cas9-RNA com- 
plexes bound to cognate targets. 

Initially, we used catalytically inactive dCas9 to avoid DNA cleavage. 
Notably, experiments with wild-type Cas9 also failed to reveal DNA cleav- 
age; rather, Cas9-RNA molecules remained bound to their target sites, 
yielding identical results to those obtained using dCas9-RNA (Fig. 2a). 
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Figure 1 | DNA curtains assay for target binding by Cas9-RNA. 

a, Schematic of a single-tethered DNA curtain”*”’”. b, Wild-type Cas9 or 
dCas9 was programmed with crRNA-tracrRNA targeting one of six sites. 

c, YOYO1-stained DNA (green) bound by QD-tagged dCas9 (magenta) 
programmed with (2 guide RNA. d, dCas9-RNA binding distributions; error 
bars represent 95% confidence intervals obtained through bootstrap analysis”*. 
e, Image of apo-Cas9 bound to DNA curtains. f, Binding distribution of 
apo-Cas9 (n = 467); error bars represent 95% confidence intervals. g, Lifetimes 
of DNA-bound apo-Cas9 (n = 205) and Cas9-RNA (n = 104) after injection of 
10 pg ml" heparin, and of apo-Cas9 (n = 233) after injection of 100 nM 22 
crRNA-tracrRNA. 


We confirmed that the imaging conditions did not inhibit Cas9-RNA 
cleavage activity (Extended Data Fig. 1). These results suggested that 
Cas9-RNA might cleave DNA but remain tightly bound to both cleav- 
age products, which was confirmed with biochemical gel-shift assays 
using 72-bp duplex DNA substrates (Extended Data Fig. 4). To deter- 
mine whether stable product binding would prevent Cas9-RNA from 
performing multiple turnover cleavage, we conducted plasmid DNA 
cleavage assays at varying molar ratios of Cas9-RNA and target DNA 
and measured the rate and yield of product formation. Surprisingly, the 
amount of product rapidly plateaued at a level proportional to the molar 
ratio of Cas9-RNA to DNA, indicating that Cas9-RNA does not follow 
Michaelis-Menten kinetics (Fig. 2b). Turnover also did not occur with 
short duplex DNA substrates and is not stimulated by either elevated 
temperature or an excess of free crRNA-tracrRNA (Extended Data Fig. 5). 

We next used a double-tethered DNA curtain (Fig. 2c)”-*' to con- 
firm that Cas9-RNA catalysed DNA cleavage in single-molecule assays. 
Although Cas9-RNA failed to dissociate from target sites even in the 
presence of 10 pg ml" heparin (Fig. 1g) or up to 0.5 M NaCl (not shown), 
injection of 7M urea caused Cas9-RNA to release the downstream 
end containing the PAM, confirming that the DNA was cleaved at the 
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expected target site (Fig. 2d). These findings show that Cas9-RNA 
remains tightly bound to both ends of the cleaved DNA, thus acting as 
a single-turnover enzyme. 


Cas9-RNA finds targets by three-dimensional diffusion 


To determine how Cas9-RNA locates DNA targets, we visualized the 
target search process using double-tethered DNA curtains. Site-specific 
DNA-binding proteins can locate target sites by three-dimensional colli- 
sions or through facilitated diffusion processes including one-dimensional 
sliding, hopping, and/or intersegmental transfer; these mechanisms 
can be distinguished by single-molecule imaging**’!”’. For these assays, 
Cas9 programmed with (2 guide RNA was injected into the sample 
chamber, buffer flow was terminated, and reactions were visualized in 
real time. These experiments revealed long-lived binding events at the 
target site and transient binding events at other sites on the DNA 
(Fig. 2e, f). We saw no evidence of Cas9-RNA associating with target 
sites by facilitated diffusion (either one-dimensional sliding and/or hop- 
ping); instead, all target association appeared to occur directly through 
three-dimensional collisions (Fig. 2e and Supplementary Video 2). 

The shorter-lived, nonspecific binding events exhibited complex 
dissociation kinetics; the simplest model that describes the data is a 
double-exponential decay with lifetimes of ~3.3 and ~58 s (at 25 mM 
KC)) (Fig. 2f). These lifetimes were readily distinguished from the long 
lifetime of apo-Cas9 (Fig. 2f, inset). Furthermore, the experiments were 
conducted in the presence of a tenfold molar excess of crRNA-tracrRNA 
to exclude contamination from apo-Cas9. This result indicates that at 
least two and possibly more binding intermediates exist on the path- 
way towards cognate target recognition. Although nonspecific DNA 
binding typically involves electrostatic interactions with the phosphate 
backbone such that nonspecific lifetimes tend to decrease with increas- 
ing ionic strength”, the lifetimes of Cas9-RNA bound at nonspecific 
DNA sites were not appreciably affected by salt concentration (Fig. 2f). 
One remarkable implication of this finding is that these Cas9-RNA 
non-target binding events have characteristics more commonly attrib- 
uted to site-specific association*”». 

To gain further insight into the nature of the target search mech- 
anism, we measured the locations and corresponding lifetimes of all 
binding events (Fig. 2g). The off-target binding lifetime distributions 
did not vary substantially at different regions of the DNA. However, 
the number of observed binding events was not uniformly distributed 
along the substrate, indicating that some underlying feature of the A- 
DNA might influence the target search. The A-phage genome contains 
5,677 PAM sites (~1 PAM per 8.5 bp), but it also has an unusual polar 
distribution of (A+T)- and (G+C)-rich sequences”, which leads to an 
asymmetric distribution of PAMs (5'-NGG-3’ for S. pyogenes Cas9) 
(Fig. 2g). Pearson correlation analysis revealed that the Cas9-RNA bind- 
ing site distribution was positively correlated with the PAM distribution 
(r = 0.59, P< 0.05) (Fig. 2h). When we repeated this experiment using a 
guide RNA having no complementary target sites within 1-DNA (spacer 
2 crRNA)®, we saw no change in the binding lifetimes and an even 
stronger correlation with the PAM distribution (Fig. 2h and Extended 
Data Fig. 2b, c). These results, together with the insensitivity of short- 
lived binding events to ionic strength, suggested that Cas9-RNA might 
bind specifically to PAMs and minimize interactions with non-PAM 
DNA while searching for potential targets. 


A PAM is required for DNA interrogation 

To test whether Cas9-RNA uses PAM recognition as an obligate pre- 
cursor to interrogation of flanking DNA for potential guide-RNA 
complementarity, we used competition assays to monitor the rate of 
Cas9-RNA-mediated DNA cleavage (Fig. 3a, b). From these data we 
extracted the average amount of time that Cas9-RNA spends sam- 
pling each competitor DNA before locating and cleaving a radiolabelled 
substrate (Extended Data Fig. 6). In control experiments, reaction kinetics 
were not perturbed by the presence of an unlabelled competitor DNA 
lacking PAMs and bearing no sequence relationship to the crRNA, 
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Figure 2 | Cas9-RNA remains bound to cleaved products and localizes to 
PAM-rich regions during the target search. a, Wild-type Cas9-RNA bound 
to DNA curtains. b, Cleavage yield of 25 nM plasmid DNA is proportional 

to [Cas9-RNA]. c, Schematic of a double-tethered DNA curtain?*’. 

d, Liberation of the cleaved DNA with 7 M urea; asterisks denote QDs that are 
attached to the lipid bilayer but not bound to the DNA. e, Kymographs 
illustrating distinct binding events. f, Survival probabilities (mean + 70% 
confidence intervals) for non-target binding events. Solid lines represent 
double-exponential fits; n = 502 (0mM KC]), n = 632 (25 mM KCl), and 

n = 504 (100 mM KCl). Inset: survival probabilities of DNA-bound apo-Cas9 
(n = 233) and target DNA-bound Cas9-RNA (n = 104). g, Distribution of 
Cas9-RNA binding events (n = 2,330) and PAM sites; error bars represent 95% 
confidence intervals. Colour-coding reflects the binding dwell time (t;) 
relative to the mean dwell time (#). h, Correlation of PAM distribution and 
non-target Cas9-RNA binding distribution for (2 (blue) and spacer 2 (green) 
guide RNAs. 


whereas a competitor containing a PAM and fully complementary tar- 
get sequence substantially reduced the cleavage rate of the radiolabelled 
substrate (Fig. 3b). 

A series of competitors were tested that bore no crRNA guide sequence 
complementarity (Extended Data Table 1) but contained increasing num- 
bers of PAMs (Fig. 3c). There was a direct correspondence between the 
number of PAMs and the ability ofa DNA competitor to interfere with 
target cleavage, indicating that the lifetime of Cas9-RNA on compet- 
itor DNA increased with PAM density (Fig. 3c). This result persisted 
over a range of competitor DNA concentrations (Extended Data Fig. 6), 
and the same pattern of competition was observed for DNA binding 
by dCas9-RNA (Extended Data Fig. 7). These results demonstrate that 
the residence time of Cas9-RNA on non-target DNA lacking PAMs is 
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negligible, and support the hypothesis that transient, non-target DNA 
binding events observed on the DNA curtains probably occurred at 
PAM sequences. Although Cas9-RNA complexes undoubtedly sam- 
ple DNA lacking PAMs, these rapid binding events are neither detect- 
able in single-molecule and bulk binding experiments (Extended Data 
Fig. 7), nor do they appreciably influence overall reaction kinetics in 
bulk biochemical assays. 

We repeated the competition assay with a competitor bearing per- 
fect complementarity to the crRNA, but with a single point mutation 
in the adjacent PAM (5’-TCG-3')—like similarly mutated substrates®, 
this competitor cannot be cleaved by Cas9-RNA (Extended Data Fig. 7). 
This competitor failed to inhibit DNA cleavage by Cas9-RNA and 
behaved comparably to the non-target competitor DNA lacking PAMs, 
despite the fact that it contained perfect complementarity to the crRNA 
(Fig. 3c). Together, these results demonstrate that PAM recognition is 
an obligate first step during target recognition by Cas9-RNA, as previ- 
ously proposed’. 


Mechanism of RNA-DNA heteroduplex formation 


After PAM recognition, Cas9-RNA must destabilize the adjacent duplex 
and initiate strand separation to enable base pairing between the target 
DNA strand and the crRNA guide sequence. Because Cas9 has no energy- 
dependent helicase activity, the mechanism of local DNA unwinding 
has been enigmatic, but must rely on thermally available energy. One 
possibility is that PAM binding could induce a general destabilization 
of the duplex along the length of the entire target sequence, leading to 
random nucleation of the RNA-DNA heteroduplex (Fig. 3d, top). 
Alternatively, PAM binding may cause only local melting of the duplex, 
with the RNA-DNA heteroduplex nucleating at the 3’ end of the target 
sequence next to the PAM and proceeding sequentially towards the 
distal 5’ end of the target sequence (Fig. 3d, bottom). 

To distinguish between these models, we conducted cleavage assays 
with a panel of DNA competitors in which the length and position of 
complementarity to the guide RNA was systematically varied (Extended 
Data Table 1). These competitors were designed to distinguish between 
the random nucleation and sequential unwinding models for hetero- 
duplex formation based on the predicted patterns of cleavage inhibi- 
tion for each model (Fig. 3d). The ability ofa competitor DNA to inhibit 
substrate cleavage by Cas9-RNA increased as the extent of comple- 
mentarity originating at the 3’ end of the target sequence adjacent to 
the PAM increased (Fig. 3e). Inhibition increased markedly when 12 
or more base pairs were complementary to the crRNA guide sequence, 
which agrees with the requirement for an 8-12-nucleotide seed sequence 
for the Cas9-RNA DNA cleavage reaction®’*. Notably, although com- 
petitors containing mismatches to the crRNA at the 5’ end of the target 
sequence competed effectively for Cas9-RNA binding, competitors 
containing mismatches to the crRNA at the extreme 3’ end immedi- 
ately adjacent to the PAM were completely inert to binding (Fig. 3e). 
This was true even with a 2-bp mismatch followed by 18 bp of contig- 
uous sequence complementarity to the crRNA. Therefore, when mis- 
matches to the crRNA are encountered within the first two nucleotides 
of the target sequence, Cas9-RNA loses the ability to interrogate and 
recognize the remainder of the DNA. The pattern of inhibition observed 
with the different competitor DNAs indicates that sequence homology 
adjacent to the PAM is necessary to initiate target duplex unwinding 
until the reaction has proceeded sufficiently far (~12 bp, approxi- 
mately one turn of an A-form RNA-DNA helix), such that the energy 
necessary for further propagation of the RNA-DNA heteroduplex 
falls below the energy needed for the reverse reaction. These findings 
indicate that formation of the RNA-DNA heteroduplex initiates at the 
PAM and proceeds through the target sequence by a sequential, step- 
wise unwinding mechanism consistent with a Brownian ratchet*’. 

Asa further test of this model, we used a DNA competitor that con- 
tained mismatches to the crRNA at positions 1-2 but was itself mis- 
matched at the same two positions, forming a small bubble in the duplex. 
Despite the absence of sequence complementarity to the crRNA within 
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the DNA bubble, this substrate was a robust competitor and bound 
Cas9-RNA with an affinity nearly indistinguishable from that of a per- 
fect substrate (Fig. 3e and Extended Data Fig. 8). Notably, this DNA 
could also be cleaved with near-wild-type rates (Extended Data Fig. 8). 
We speculate that the presence of the DNA bubble allowed Cas9-RNA 
to bypass the mismatches and reinitiate nucleation of the RNA-DNA 
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heteroduplex downstream of the bubble, thereby propagating strand 
separation through the remainder of the target. 


The PAM triggers Cas9 nuclease activity 


The results presented above indicate that PAM recognition has a cen- 
tral role in target recognition, and that introduction ofa small bubble in 
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Figure 4 | PAM recognition regulates Cas9 nuclease activity. a, Cleavage 
assay with single-stranded, double-stranded and partially double-stranded 
substrates. b, Relative affinities and cleavage rates (mean + s.d.). c, Model for 
target search, recognition and cleavage by Cas9-RNA. The search initiates 
through random three-dimensional collisions. Cas9-RNA rapidly dissociates 
from non-PAM DNA, but binds PAMs for longer times and samples adjacent 


DNA for guide RNA complementarity, giving rise to a heterogeneous 
population of intermediates. At correct targets, Cas9-RNA initiates formation 
of an RNA-DNA heteroduplex, and R-loop expansion propagates via 
sequential unwinding. The DNA is cleaved, and Cas9-RNA remains bound to 
the cleaved products. 
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the DNA target eliminates the need for RNA-DNA heteroduplex for- 
mation immediately adjacent to the PAM. One might expect PAM 
recognition to be dispensable for Cas9-RNA-mediated recognition and 
cleavage of a single-stranded DNA (ssDNA) target. Surprisingly, how- 
ever, a ssDNA substrate was cleaved more than two orders of mag- 
nitude slower than a dsDNA substrate (Fig. 4a, b), despite the fact that 
dCas9-RNA bound both dsDNA and ssDNA substrates with similar 
affinities (Fig. 4b and Extended Data Fig. 9). 

Importantly, Cas9-RNA recognizes the 5’- NGG-3’ PAM on the non- 
target DNA strand®, so the ssDNA substrates did not contain a PAM 
but rather the complement to the PAM sequence. We proposed that the 
absence of the PAM on the ssDNA might explain why an otherwise fully 
complementary target is resistant to cleavage. To test this possibility, 
we prepared hybrid substrates with varying lengths of dsDNA at the 3’ 
flanking sequence (Fig. 4a). Cleavage assays revealed that the ssDNA 
target strand could be activated for cleavage in the presence of flanking 
dsDNA that extended across the PAM sequence, but that this activa- 
ting effect was lost when the dsDNA was truncated immediately before 
the PAM (Fig. 4a, b and Extended Data Fig. 9). Gel-shift experiments 
confirmed that these results were not a consequence of discrimination 
at the level of binding (Fig. 4b). Rather, the presence of the 5’-NGG-3’ 
PAM on the non-target strand was critical for some step of the reaction 
that occurred after binding. These data suggest that the PAM acts as an 
allosteric regulator of Cas9-RNA nuclease activity. 


Discussion 


Our results suggest a model for target binding and cleavage by Cas9- 
RNA involving an unanticipated level of importance for PAM sequences 
at each stage of the reaction (Fig. 4c). Although details may differ, we 
propose that PAM interactions may function similarly for other CRISPR 
RNA- guided surveillance complexes***. The Cas9-RNA target search 
begins with random collisions with DNA. However, rather than sam- 
pling all DNA equivalently, Cas9-RNA accelerates the search by rapidly 
dissociating from non-PAM sites, thereby reducing the amount of time 
spent at off-targets. Only upon binding to a PAM site does Cas9-RNA 
interrogate the flanking DNA for guide RNA complementarity, as was 
previously proposed for Cas9 (ref. 6) and a distinct CRISPR RNA-guided 
complex (Cascade). A requirement for initial PAM recognition also 
eliminates the potential for self-targeting, as perfectly matching targets 
within the bacterial CRISPR locus are not flanked by PAMs. Our results 
indicate that PAM recognition coincides with initial destabilization of 
the adjacent sequence, as demonstrated by experiments using a bubble- 
containing DNA substrate, followed by sequential extension of the RNA- 
DNA heteroduplex. This mechanism explains the emergence of seed 
sequences, because mismatches encountered early in a directional melting- 
in process would prematurely abort target interrogation. Moreover, the 
complex dissociation kinetics observed on non-target A-DNA would 
arise from heterogeneity in the potential target sites as Cas9-RNA 
probes sequences adjacent to PAMs for guide RNA complementarity. 
Binding to a correct target then leads to activation of both nuclease 
domains. This step also requires PAM recognition, providing an unan- 
ticipated level of PAM-dependent regulation that may ensure further 
protection against self-cleavage of the CRISPR locus. Notably, Cas9- 
RNA does not dissociate from the cleaved DNA except under extremely 
harsh conditions, indicating that Cas9-RNA may remain bound to the 
cleaved site in vivo* and require other cellular factors to promote recyc- 
ling. Finally, our data indicate that efforts to minimize off-target effects 
during genome engineering using Cas)-RNA complexes need only 
consider off-targets adjacent to a PAM, because potential targets lack- 
ing a PAM are unlikely to be interrogated***”, 


METHODS SUMMARY 

Cas9 from S. pyogenes containing a C-terminal 3 -Flag tag was purified as described’. 
crRNAs and tracrRNAs were either transcribed in vitro with T7 polymerase or 
made synthetically. Cas9-RNA complexes for single-molecule experiments were 
reconstituted by incubating Cas9 and a 10X molar excess of crRNA-tracrRNA in 
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reaction buffer (20 mM Tris-HCl pH 7.5, 100 mM KCl, 5 mM MgCl, 5% glycerol, 
1mM dithiothreitol (DTT)) for ~10 min at 37 °C, before injecting 1-2 nM into 
the flow cell. Single-tethered DNA curtains were prepared and TIRFM conducted 
as described**”’. dCas9-RNA positions were determined by fitting a two-dimensional 
Gaussian to individual molecules. Double-tethered DNA curtains were prepared 
as described*'*’, and position and lifetime measurements were determined from 
kymographs generated for each DNA molecule. Duplex DNA substrates used in 
biochemical experiments were prepared from synthetic oligonucleotides (Integrated 
DNA Technologies), and their sequences can be found in Extended Data Table 1. 
Bulk competition cleavage assays were conducted at room temperature in reaction 
buffer and contained ~1 nM radiolabelled 1.1 target DNA, 10 nM Cas9-RNA com- 
plex and 500 nM competitor DNA. Aliquots were removed at each time point and 
quenched by the addition of gel loading buffer supplemented with 50 mM EDTA. 
Reaction products were resolved by 10% denaturing polyacrylamide gel electro- 
phoresis, visualized by phosphorimaging, and quantified using ImageQuant (GE 
Healthcare). Competition experiments were analysed to determine the survival 
probability of the target DNA. After determining the conditional survival prob- 
ability for each time course, the change in survival probability relative to a reaction 
in the absence of competitor was calculated. Integration of the resulting curve over 
the total reaction time yielded the mean time spent on competitor DNA for each 
experiment. 


Online Content Any additional Methods, Extended Data display items and Source 
Data are available in the online version of the paper; references unique to these 
sections appear only in the online paper. 
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Molecular basis of nitrate uptake by the 
plant nitrate transporter NRT1.1 


Joanne L. Parker! & Simon Newstead!” 


The NRT1/PTR family of proton-coupled transporters are responsible for nitrogen assimilation in eukaryotes and bacteria 
through the uptake of peptides. However, in most plant species members of this family have evolved to transport nitrate 
as well as additional secondary metabolites and hormones. In response to falling nitrate levels, NRT1.1 is phosphorylated 
on an intracellular threonine that switches the transporter from a low-affinity to high-affinity state. Here we present both 
the apo and nitrate-bound crystal structures of Arabidopsis thaliana NRT1.1, which together with in vitro binding and 
transport data identify a key role for His 356 in nitrate binding. Our data support a model whereby phosphorylation 
increases structural flexibility and in turn the rate of transport. Comparison with peptide transporters further reveals how 
the NRT1/PTR family has evolved to recognize diverse nitrogenous ligands, while maintaining elements of a conserved 
coupling mechanism within this superfamily of nutrient transporters. 


Nitrogen is a key element in biology, required for the synthesis of amino 
and nucleic acids and is a fundamental nutrient for cellular metabolism. 
The PTR or peptide transporter family, also known as the POT, or 
proton dependent oligopeptide transporter family, has an important 
role in nitrogen assimilation in bacteria, fungi and mammals through 
the uptake of short peptides from the environment’”. In contrast, in the 
majority of plant species nitrogen is largely obtained through the uptake 
of nitrate (NO3_) from the soil’. This is achieved through the actions 
of specific nitrate transporters in the plasma membrane of root cells* 
and is controlled through the NRT1 and NRT2 gene families that 
encode for low (Ky, ~ mM) and high (Ky ~ 11M) affinity transporters, 
respectively*®. Intriguingly, the NRT1 family of nitrate transporters 
phylogenetically belongs to the PTR family, and may have evolved from 
an ancestral peptide transport protein’. 

In plants the NRT1/PTR family, recently renamed the NPF (NRT1/ 
PTR family)’, has functionally diverged with individual members recog- 
nizing peptides, glucosinolate defence compounds, plant hormones or 
nitrate”. The NRT 1/PTR family belongs to the major facilitator super- 
family (MFS) of secondary active transporters'*"* that use the proton 
electrochemical gradient to drive substrate uptake into the cell'*"'”. MFS 
transporters operate through an alternating access mechanism, wherein 
a centrally located binding site is re-orientated to either side of the mem- 
brane to uptake and release substrate’**°. Recent crystal structures of 
bacterial members of the NRT1/PTR family have identified pairs of salt 
bridges that orchestrate the orientation of the transporter and revealed 
proton binding residues that are conserved across the family from 
prokaryotes to eukaryotes”’””. An important question therefore is how 
the binding site in this transporter family has evolved to recognize and 
transport diverse molecules while retaining elements of the same coup- 
ling mechanism. 

The Arabidopsis thaliana nitrate transporter, NRT 1.1 (NPF6.3), is 
the founding member’ of the NRT1/PTR family of transporters and 
shares significant sequence identity to mammalian and bacterial PTR 
peptide transporters (Extended Data Fig. 1). Interestingly, NRT 1.1 has 
two Ky values for nitrate’; in conditions of high nitrate availability 
(>1mM) NRT1.1 behaves as a low-affinity transporter (Ky, ~ 4mM). 
However, when nitrate levels fall below 1mM, NRT1.1 is switched 
into a high-affinity mode (Ky, ~ 40 1M). This switch occurs following 


phosphorylation of an intracellular threonine, Thr 101, by the kinase 
CIPK23™. This regulatory mechanism allows for the rapid adaption to 
changing nitrate levels before the dedicated high affinity NRT2 trans- 
porter family is expressed’*. These observations suggest a complex inter- 
play between transporter activity and post-translational modification 
at the molecular level that currently has no structural or biochemical 
basis for explanation. 


Structural basis of nitrate recognition in NRT1.1 

To study the binding of nitrate to NRT1.1 we developed a microscale 
thermophoresis assay*®”’ (Extended Data Fig. 2a) and determined the 
Kp for nitrate to be 1 + 0.15 mM (Fig. 1a and Extended Data Fig. 2b). 
A wide range of nitrate concentrations were tested (1 4M-50 mM) and 
we did not observe a high affinity state. We further tested a range of 
different ligands and show that NRT1.1 specifically recognizes both 
nitrate and chlorate, a herbicide for which the NRT1.1 protein was ori- 
ginally named (chlorate resistance 1)’ but does not recognize nitrite, 
alanine, sulphate, phosphate or the di-peptide Ala-Ala (Extended Data 
Fig. 2c). A key question surrounding the function of NRT1.1 is the 
switch between high and low affinity states. Using the phosphomimetic 
variant Thr101Asp, we investigated the effect on nitrate binding. The 
Kp for nitrate binding to the Thr101Asp variant was the same as for 
wild type (Kp = 1 0.12 mM) (Fig. 1a), indicating that phosphoryla- 
tion is unlikely to alter the nitrate-binding site directly. To follow up 
this finding and identify the nitrate-binding site we crystallized NRT1.1 
in the presence and absence of nitrate. The crystal structures were 
determined to 3.7 A (Extended Data Table 1). 

The apo structure of NRT1.1 contains 12 transmembrane (TM)- 
spanning alpha helices, consisting of amino-(TM1-TM6) and carboxy- 
terminal (TM7-TM12) bundles forming the canonical MFS fold (Fig. 1b). 
Separating these two bundles is a previously uncharacterized intracel- 
lular domain consisting of 84 amino acids. This domain is predominantly 
alpha helical and extends outwards from the transporter domain at an 
approximately 90° angle, which we have termed the lateral helix (Fig. 1c). 
At the distal end of this helix are three conserved positively charged 
residues that may help to stabilize this domain on the intracellular side 
of the membrane. It is unlikely that this domain is a crystallographic 
artefact as it does not make any crystal contacts and is sufficiently 
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mobile that we cannot model residues 270-325. NRT1.1 crystallizes as 
a dimer in the asymmetric unit cell, despite being monomeric in solu- 
tion (Extended Data Fig. 3). The dimer interface forms between TM3 
and TM6 of monomer A packing against the equivalent helices in 
monomer B and stabilized by hydrophobic interactions (Extended 
Data Fig. 3e). The buried surface area between the two monomers is 
approximately 2,136 A” and may be physiologically significant as both 
monomers are in the same orientation. Both monomers A and B adopt 
the same inward open conformation (with a root mean squared devi- 
ation (r.m.s.d.) between monomers A and B of ~ 0.69 A for 472 Cy 
atoms) with a clearly defined cavity extending outward from the mid- 
dle of the protein towards the intracellular side of the membrane. 
Surprisingly, the cavity is much larger than expected given the size of 
nitrate, being approximately 21 X 18 X 17 A in diameter (Fig. 2a). The 
extracellular side of the binding site is tightly sealed through the pack- 
ing together of TM1-TM2 against TM7-TM8. These helices form the 
extracellular gate in MFS transporters, which control access to the 
binding site from the outside of the cell’’*”*. Interestingly, there exist 
no salt bridge interactions between these helices, unlike the prokaryotic 
NRT1/PTR homologues’*”’*”; instead the extracellular gate is stabi- 
lized through extensive hydrophobic interactions (Extended Data Fig. 4). 

The nitrate-bound structure was unchanged from the apo state with 
an r.m.s.d. of approximately 0.44 A for 472 C,, atoms. However, we 
observed a significant (>30) mFo — DFc difference electron density 
peak within the central cavity, which we observed in crystals grown in 
the presence of nitrate. We therefore modelled nitrate into this density 
(Fig. 2 a, b and Extended Data Fig. 5). The nitrate molecule sits approxi- 
mately 2.8 A away from His 356 on TM7; given the pH of the crystal- 
lization condition was 4.5 His 356 is likely to be protonated and forming 
an electrostatic interaction with nitrate. An additional interaction via a 
hydrogen bond occurs with Thr 360 on TM7, which sits approximately 
29A away at the apex of the binding site. The binding of nitrate through 
an electrostatic interaction is similar, but not identical, to the recent 
crystal structure of NarU, a member of the biochemically distinct nitrate 


ARTICLE 


Figure 1 | Structural and biophysical 
characterization of NRT1.1. a, Binding isotherms 
for nitrate to both wild type (black circles) and 
Thr101Asp (red triangles) reveals no significant 
change in the Kp. Data shown is representative of 
three independent experiments and the error is 
calculated as standard deviation. b, Cartoon 
representation of the crystal structure of NRT1.1 
viewed from the extracellular side of the plasma 
membrane. Transmembrane helices (TM) 1-12 
have been coloured from blue at the amino 
terminus to red at the carboxy terminus. Thr 101 is 
shown as magenta spheres. c, NRT 1.1 viewed in the 
plane of the membrane represented as both cartoon 
and electrostatic surface, positively charged and 
aromatic residues at the end of the lateral helix are 
shown in sticks and may help to anchor this helix in 
the interfacial region of the plasma membrane. 
Residues 270-325 of the intracellular domain could 
not be modelled due to insufficient electron 
density. 


nitrite porter (NNP) family, where two conserved arginine residues act 
to coordinate nitrate through hydrogen bonds (Extended Data Fig. 6)*’. 
The other possible sites of interaction for nitrate therefore could be 
either Arg 45 (TM1) and/or Lys 164 (TM4). We mutated all three of 
these positively charged residues to alanine; however, mutation of only 
His 356 resulted in complete loss of nitrate binding, indicating an essen- 
tial role of His 356 in NRT1.1 (Fig. 2c). 


Substrate specificity in the NRT1/PTR family 


As discussed previously, members of the NRT1/PTR family can trans- 
port peptides and in the case of plant members, nitrate, hormones and 
metabolites. A key question we wished to address was how the bind- 
ing site of NRT 1.1 differed to that of the peptide transporters. Compar- 
ison with the binding site from the bacterial peptide transporter from 
Shewanella oneidensis, PepTs., reveals that the two binding sites are 
strikingly similar (Fig. 2d). The previously identified ExxER motif on 
TM1, which has an important role in coupling proton binding to pep- 
tide transport” is present in the same position, as are Lys 164 (TM4) 
and Glu 476 (TM10). As predicted for their bacterial counterparts*’”” 
Lys 164 and Glu 476 are likely to interact via a salt bridge in the out- 
ward facing conformation and may form an important link between 
the N and C-terminal bundles during transport (Extended Data Fig. 7). 

Our comparison indicates that the mechanism required for coupling 
proton movement to structural changes during transport is conserved 
regardless of substrate specificity. It is known the stoichiometry of NRT1.1 
and mammalian peptide transporters is at least 2:1 proton:solute, as 
transport is electrogenic’. It is therefore conceivable the ExxER motif 
couples one proton, leaving His 356 on TM7 to bind another proton 
and nitrate. In peptide transporters the equivalent region on TM7 has 
been shown to couple proton binding to peptide recognition” through 
an acidic residue, Asp 316 in PepTs,. One major difference between the 
binding sites is the absence of an obvious salt bridge connecting the 
extracellular gate helices TM1 and TM7. Indeed in PepTs, the arginine 
of this salt bridge occupies the same position in the protein as the 
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Figure 2 | Crystal structure of nitrate bound NRT1.1. a, Section through 
the protein volume showing the binding site in the plane of the membrane, with 
the mF, — DF, difference electron map contoured at 30 (green) identifying the 
position of nitrate. b, Zoomed in view of the binding site. Shown in sticks are the 
conserved NRT1/PTR residues and His 356 that can be seen coordinating 
nitrate within the binding site, supported by Thr 360. Dashed lines indicate a 
potential interaction network between His 356, Tyr 388 and Glu 476 in the 


nitrate (Fig. 2b, d). Another significant difference is the replacement of 
the two conserved aromatic residues on TM1 (Phe 28, Tyr 29), which 
are responsible for peptide specificity”’ with residues containing smaller 
side chains helping to create space for nitrate to bind. Given that di- 
peptides are coordinated horizontally between the N- and C-terminal 
bundles”, it is conceivable that the increased width of the cavity in 
NRT1.1 can no longer coordinate the amino and carboxy termini simul- 
taneously. Coupled with the absence of the salt bridge and tyrosines, 
which help position the peptide and coordinate movement of TM1 and 
TM7, these alterations could explain why NRT1.1 cannot recognize or 
transport peptides. 


Functional role of phosphorylation at Thr 101 


Our structures show that Thr 101 is situated at the bottom of TM3 and 
points towards a hydrophobic pocket constructed from residues in 
TM2 and TM4 (Fig. 3a). Given the position of Thr 101 it is highly likely 
that phosphorylation would cause localized structural distortion in the 
packing of TM3 with TM1 and TM4 on the intracellular side of the 
protein. Consistent with this hypothesis we discovered that the Thr101Asp 
variant of NRT1.1 was significantly less stable compared to wild-type 
protein (9 °C lower melting temperature) (Fig. 3b). To investigate the 
effect of the Thr101Asp substitution on nitrate uptake we reconstituted 
NRT1.1 into liposomes and monitored the relative uptake of nitrate 
as a function of proton movement using a pH-sensitive fluorophore 
(Extended Data Fig. 8). As NRT1.1 is a proton-coupled nitrate trans- 
porter this assay provides a reliable readout for nitrate transport. We 
discovered that compared to the wild-type protein, the Thr101Asp 
variant showed increased uptake of approximately 2.8-fold, whereas 
the His356Ala variant showed no detectable transport, consistent with 
an essential role in nitrate recognition (Fig. 3c). 

Given the high sequence conservation of the N-terminal bundle within 
the NRT1/PTR family (Extended Data Figs 1 and 9a), we mutated the 
equivalent threonine in PepT, (Thr 87) to investigate any general effects 
disrupting the packing between these helices may have on this family. 
Interestingly the Thr87Asp variant also resulted in an increase in the 
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C-terminal bundle. c, Binding isotherms showing the effect of mutating 
positively charged residues in the binding site on the affinity of NRT1.1 for 
nitrate. Data shown is representative of three independent experiments and the 
error is calculated as standard deviation. d, Similarity between the binding sites 
of the peptide transporter PepTs, and NRT1.1. The residues important for 
peptide transport are shown in sticks. The relative position of nitrate from the 
NRT1.1 structure is superimposed (green mesh). 


rate of peptide transport, whereas other variants Thr87Ala/Ser/Lys showed 
wild-type-like rates (Extended Data Fig. 9b-d), showing that this region 
is sensitive to the addition ofa negative charge. Comparable to what we 
observed for NRT1.1, the Thr87Asp variant was also less stable than 
wild-type protein by 4 °C (Extended Data Fig. 9e). However, a Thr87Arg 
variant was inactive, indicating that gross structural distortion in this 
region cannot be tolerated. This shows that disrupting the helix packing 
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Figure 3 | Proposed mechanism for the effect of phosphorylation at Thr 101 
on nitrate transport. a, Thr 101 (represented as magenta spheres) in NRT1.1 is 
situated in a hydrophobic cavity formed at the intracellular ends of TM2 and 
TM4. b, Thermal stability as assessed by the loss of alpha helical secondary 
structure using circular dichroism shows a reduction in the Thr101Asp variant 
when compared to wild-type protein. c, The Thr101Asp variant of NRT1.1 
shows a significant increase in transport when compared to wild-type protein 
in a reconstituted assay system, whereas the His356Ala mutant shows no 
activity. Data represent the mean value from eight experiments and error bars 
are standard deviation. 
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between TMs 1, 2, 3 and 4 in the N-terminal bundle of the NRT1/PTR 
family can markedly affect transport rates. Similar studies have shown 
that altering the structural flexibility within the MFS fold can affect the 
rate-limiting step of transport****. We propose that a similar mech- 
anism may occur in NRT1.1 to switch the protein between a high and 
low Ky, state. Phosphorylation causes a localized disruption of the 
N-terminal bundle helices that leads to an increased flexibility of the 
protein. This in turn leads to an increase in the transport rate, that 
would result in the lower Ky, observed in the previous study™*. 


Model for proton-coupled nitrate transport via NRT1.1 


Taken together, a working model for proton-coupled nitrate transport 
emerges from our study (Fig. 4). The protonation of His 356 is clearly a 
key component of the transport mechanism. Our model predicts that 
protonation must occur before nitrate binding and may be conditional 
upon the presence of nitrate to stabilize the charged state of His 356. 
Following closure of the extracellular gate and adoption of the occluded 
state, the release of both proton and nitrate from His 356 must be coupled 
to the opening of the intracellular gate. A possible mechanism in achiev- 
ing this is that adoption of the occluded state brings His 356 into close 
proximity to Glu 476, causing the disruption of the intracellular gate 
salt bridge (Extended Data Fig. 7). The interaction between His 356 
and Glu 476 may be facilitated by Tyr 388 (TM8), which can be seen 
making a hydrogen bond to Glu 476 in the crystal structures. The bind- 
ing of additional protons, potentially to the ExxER motif, might also 
affect the strength of the salt bridge between Lys 164 and Glu 476, facil- 
itating opening of the intracellular gate and release of bound nitrate 
and protons into the interior of the cell. Our data suggest that in NRT1.1 
the effect of phosphorylation is to cause a localized disruption in the 
N-terminal bundle that in turn may affect the position of the ExxER 
motif with respect to Lys 164. The increase in transport rate that occurs 
is possibly the result of altering the efficiency in the formation and 
breakage of the Lys 164—-Glu 476 salt bridge and release of nitrate from 
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Figure 4 | Alternating access model for nitrate-proton symport. a, In the 
outward facing state the extracellular gate, constructed from TM1-TM2 (dark 
blue) and TM7-TMS8 (yellow) is open and the intracellular gate, constructed 
from TM4-TMS (cyan) packing against TM10-TM11 (orange) is closed and 
stabilized by a salt bridge between Lys 164 (TM4) and Glu 476 (TM10), 
similarly to the bacterial homologues. b, Following protonation of His 356, 
nitrate is able to bind triggering closure of the extracellular gate. Additional sites 
of protonation exist possibly within the ExxER motif (TM1). c, Upon 
transition to the occluded state the intracellular gate salt bridge will break as a 
result of Glu 476 moving to form a new salt bridge with His 356. d, This will 
result in the release of nitrate and protons into the interior of the cell. 

e, Phosphorylation of Thr 101 (purple sphere) by CIPK23 results in increased 
flexibility within the N-terminal bundle, increasing the overall transport rate. 
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the binding site. Perhaps the most surprising finding from our study, 
however, is that only minor changes to the previously characterized 
peptide transporter binding site are required to accommodate nitrate 
and that these are predominantly located in the C-terminal bundle. 
The ability of the PTR family to recognize multiple ligands may there- 
fore stem from the separation of a fundamentally conserved proton 
coupling mechanism that resides largely in the N-terminal bundle and 
orchestrates gross conformational changes between TM4-TM5 and 
TM10-TM11, while leaving the C-terminal bundle to evolve to recog- 
nize different nitrogenous ligands. 


METHODS SUMMARY 


NRT1.1 was cloned into a modified, cleavable green fluorescent protein-histidine 
(GFP""*) fusion vector and expressed in Saccharomyces cerevisiae*®. The fusion 
protein was solubilized in 1% n-dodecyl-f-b-maltopyranoside (DDM) and puri- 
fied to homogeneity. For MST binding measurements the GFP" protein was left 
attached to the C terminus. For crystallization the GEP™"* was removed through 
tobacco etch virus protease cleavage. Crystals were grown at pH 4.5 by the vapour 
diffusion method at 4 °C. Data were collected on beamlines 102, IO3, 104 and 104-1 
at the Diamond light source, UK. The crystals were derivatized with K,Au(CN)» 
overnight and the phases calculated using single anomalous dispersion. For the 
nitrate co-complex, purification and crystallization was carried out in the presence 
of 10 mM sodium nitrate. The final apo and nitrate-bound structures were refined 
to 3.7 A resolution. MST binding studies were carried out as described in methods 
following the standard protocols for the instrument. Binding isotherm data were 
fitted to a sigmoidal dose-response equation. Nitrate uptake was assayed in pro- 
teoliposomes containing wild-type, His356Ala, Thr101Asp variants of NRT1.1 or 
no protein controls. Nitrate uptake was measured as a function of coupled proton 
movement driven by an inwardly directed proton gradient through the quenching 
of the fluorescent pH indicator dye pyranine. 


Online Content Any additional Methods, Extended Data display items and Source 
Data are available in the online version of the paper; references unique to these 
sections appear only in the online paper. 
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Crystal structure of the plant dual-affinity 
nitrate transporter NRT1.1 


Ji Sun’, John R. Bankston’, Jian Payandeh't, Thomas R. Hinds’, William N. Zagotta” & Ning Zheng’* 


Nitrate is a primary nutrient for plant growth, but its levels in soil can fluctuate by several orders of magnitude. Previous 
studies have identified Arabidopsis NRT1.1 as a dual-affinity nitrate transporter that can take up nitrate over a wide range 
of concentrations. The mode of action of NRT1.1 is controlled by phosphorylation ofa key residue, Thr 101; however, how 
this post-translational modification switches the transporter between two affinity states remains unclear. Here we 
report the crystal structure of unphosphorylated NRT1.1, which reveals an unexpected homodimer in the inward- 
facing conformation. In this low-affinity state, the Thr 101 phosphorylation site is embedded in a pocket immediately 
adjacent to the dimer interface, linking the phosphorylation status of the transporter to its oligomeric state. Using a 
cell-based fluorescence resonance energy transfer assay, we show that functional NRT1.1 dimerizes in the cell membrane 
and that the phosphomimetic mutation of Thr 101 converts the protein into a monophasic high-affinity transporter by 
structurally decoupling the dimer. Together with analyses of the substrate transport tunnel, our results establish a 
phosphorylation-controlled dimerization switch that allows NRT1.1 to uptake nitrate with two distinct affinity modes. 


Active nitrate (NO; _) uptake by roots represents the critical first step 
of nitrogen acquisition in plants, which render the essential element to 
animals in organic forms. The abundance of nitrate in soil is affected 
by many environmental factors. As a result, soil concentrations of nitrate 
can undergo rapid changes, varying from low micromolar to high milli- 
molar concentrations. In adaptation to fluctuating nitrate levels, plants 
have evolved two complementary nitrate transporter systems with dis- 
tinct kinetic properties'*. The high-affinity transport system (HATS), 
which consists of members of the NRT2/NNP family, drives nitrate 
uptake with Michaelis constant (K,,) values in the micromolar range, 
whereas the low-affinity transport system (LATS), comprising the NRT1/ 
PTR family, transports nitrate at millimolar concentrations. In response 
to changes in nitrogen availability and demands, the activity of select 
components of these two systems can be further fine-tuned by transcrip- 
tional regulation and post-translational modifications. Transporters 
of both families belong to the major facilitator superfamily (MFS) of 
secondary active transporters and are dependent on protons for nitrate 
transport*®. 

The Arabidopsis NRT1.1 protein (also called CHL1 or NPF6.3) is 
the first nitrate transporter identified in higher plants and belongs to 
the NRT1 family®’. Distinct from most of the NRT1 and NRT2 family 
members, NRT1.1 functions as a dual-affinity transporter and con- 
tributes to both HATS and LATS*""®. In comparison to the wild-type 
plant, nrt1.1 mutants show marked nitrate uptake defects in both high- 
and low-affinity ranges. In the heterologous Xenopus oocyte expres- 
sion system, the transporter activity of NRT1.1 exhibits a characteristic 
biphasic kinetics with two different K,, values of ~50 uM and ~4mM. 
Notably, recent studies have shown that phosphorylation of a single 
residue, Thr 101, is responsible for switching NRT1.1 from the low- 
affinity to the high-affinity mode"’. Mutations of Thr 101 preventing or 
mimicking phosphorylation can effectively convert the dual-affinity 
transporter into a monophasic low-affinity or high-affinity transporter, 
respectively. Nitrate is not only a nutrient but also a signalling molecule, 
which modulates many aspects of plant physiology and optimal nitrate 


acquisition'?"'°. Independent of its transporter function, NRT1.1 also 
acts as a nitrate sensor, regulating differential expression of primary 
nitrate response genes at different nitrate levels'®'*. Remarkably, the 
sensor function of NRT1.1 also shows a biphasic pattern and is affected 
by the phosphorylation status of Thr 101. 

The MEFS permeases transport a wide spectrum of substrates and 
represent one of the largest families of secondary carriers in all species 
of life. Despite the growing number of available crystal structures of 
MES transporters'*”®, little is known about the structural mechanisms 
regulating their activities. Here we report the crystal structure of the 
full-length Arabidopsis thaliana NRT1.1, which reveals an unexpected 
phosphorylation-controlled dimerization switch that enables the trans- 
porter to operate with a dual-affinity mode. 


Overall structure of NRT1.1 


The Arabidopsis thaliana NRT1.1 gene encodes a 590-amino-acid protein, 
which is highly conserved among plant NRT1.1 orthologues, but not 
Arabidopsis NRT1 family members (Extended Data Figs 1 and 2). The 
recombinant NRT1.1 protein was overexpressed, solubilized and puri- 
fied from insect cells with dodecyl maltoside (DDM) and crystallized in 
the presence of 10 mM NaNO. With combined phases from Rosetta- 
improved molecular replacement” and single wavelength anomalous 
diffraction, we determined and refined the NRT1.1 structure at 3.25 A 
resolution (Extended Data Fig. 3 and Extended Data Table 1). 

NRT1.1 crystallized with two molecules in the asymmetric unit. 
The two copies of NRT1.1 can be superimposed with a root mean 
square deviation of 0.9 A over 504 Co atoms, indicating a common 
overall structure (Fig. 1a). As predicted, the transporter adopts a typical 
MEFS fold, which is characterized by 12 transmembrane helices (TMHs) 
with a pseudo two-fold axis relating the amino-terminal (TMH1-—6) 
and carboxy-terminal (TMH7-—12) domains (Fig. 1b). NRT1.1 is cap- 
tured in an inward-facing conformation as previously observed in the 
LacY (ref. 19), GlpT (ref. 20) and PepTg; (ref. 25) structures (Fig. lcand 
Extended Data Fig. 4). 
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Figure 1 | Crystal packing and overall structure of NRT1.1. a, Crystal 
packing of NRT1.1 in space group C2221 with two molecules in each 
asymmetric unit. b, Overall structure of NRT1.1. The N-terminal and 
C-terminal domains, the N-terminal conserved segment, the inter-domain 
linker and Pro 492 are coloured in pale green, cyan, magenta, yellow and 
orange, respectively. A functional important extracellular disulphide bond is 
indicated. c, Cutaway view showing that NRT1.1 is captured in an inward 
conformation with nitrate displayed as spheres. 


By comparing the structures of NRT1.1 and a bacterial peptide 
transporter from Streptococcus thermophilus, PepTs,, we confirm that 
eukaryotic and prokaryotic members of the NRT1/PTR family of MFS 
transporters share a similar overall architecture (Extended Data Fig. 4). 
The plant nitrate transporter, nonetheless, has three unique and con- 
served structural elements, including a well-structured N-terminal cyto- 
plasmic segment, a disulphide bond-stabilized extracellular loop, anda 
partially ordered central linker sequence (Fig. 1b). Consistent with the 
sequence divergence between the NRT1/PTR and NRT2/NNP nitrate 
transporter families, the structures of NRT1.1 and two bacterial NRT2/ 
NNP family members, NarK (ref. 24) and NarU (ref. 26), share few 
common features except for the MFS fold (Extended Data Fig. 4). 

The residue mutated (P492L) in the chl1-9 mutant, which lost the 
transporter but not the sensor function of NRT1.1 (ref. 17), is located 
at the short TMH10-TMH11 loop (Fig. 1b). Its mutation probably 
affects the structural coordination of the two helices. 


NRT1.1 dimer in the crystal 


So far, crystal structures of more than ten MFS transporters have been 
determined in the monomeric form’? ”’. The DDM-solubilized NRT1.1 
protein was also isolated in a monomeric state as determined by size- 
exclusion chromatography-coupled multi-angle light scattering mea- 
surements*’ (Extended Data Fig. 5a). A closer examination of the two 
NRT1.1 molecules in the asymmetric unit, however, reveals a possible 
biological dimer. 

In the crystal, the two adjacent non-crystallographically related NRT1.1 
molecules are juxtaposed in a side-to-side fashion with their N-terminal 
halves facing and interacting with each other (Figs 1a and 2a, b). The 
intermolecular packing is predominantly mediated by TMH3 and 
TMH6, which are located at a peripheral edge of the canonical MFS 
fold. Although crystal contacts may not always reflect biological inter- 
actions, two prominent features of the crystallographic dimer arrangement 
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Figure 2 | NRT1.1 dimer interface. a, Cylinder representation of the NRT1.1 
dimer with Thr 101 shown as red sticks. b, Extracellular view of the NRT1.1 
dimer with a central two-fold axis indicated in red. c, Two orthogonal views of 
the NRT1.1 dimer in surface representation. The dashed line represents the 
central two-fold axis. d, NRT1.1 dimer interface. The side chains of all interface 
residues are shown as spheres. e, f, Extracellular and intercellular views of 
the NRT1.1 dimer interface with TMH3 and TMH6 shown in ribbon 
representation and the side chains of interacting residues shown as sticks. The 
interface residues of one chain are labelled. 


support its physiological relevance. First, the overall topology of the 
putative NRT1.1 dimer is perfectly compatible with its transporter 
function at the membrane (Fig. 2a, b). Second, the interface between 
the two NRT1.1 molecules is extensive and complementary with a total 
surface area of ~2,160 A” (Fig. 2d-fand Extended Data Fig. 6). Overall, 
the two inward-facing NRT1.1 molecules give rise to a putative ‘in- 
phase’ dimer assembly, which is about 90 A wide and 50A thick 
(Fig. 2b). When viewed from the side, the two substrate-transporting 
tunnels are not in parallel with the central two-fold axis but slant at an 
~15° angle in opposite directions (Fig. 2c). 


Functional dimerization of NRT1.1 

To dissect the biological relevance of the NRT1.1 dimer observed in 
the crystal, we first used a crosslinking experiment to assess the poten- 
tial of detergent-solubilized NRT1.1 to dimerize in solution. Despite 
its low efficiency, an amine reactive crosslinker was able to crosslink 
NRT1.1 in a concentration-dependent manner (Fig. 3a). The cross- 
linked products migrated on SDS-polyacrylamide gel electrophoresis 
(PAGE) with a size corresponding to a NRT1.1 dimer, indicating that 
DDM-solubilized NRT1.1 is capable of forming a transient dimer in a 
membrane-free environment. 

Because solubilization by DDM might interfere with NRT1.1 dimer 
formation, we next performed fluorescence resonance energy transfer 
(FRET) spectroscopy experiments with the nitrate transporter expressed 
in the membrane of Xenopus oocytes*, which allowed us to examine 
the oligomeric state of NRT1.1 in the same lipid environment where its 
dual-affinity transporter activity has been measured. We separately 
fused the N terminus of NRT1.1 with either the mCerulean variant 
of cyan fluorescent protein (mCFP) or the mCitrine variant of yellow 
fluorescent protein (mYFP), which constitute a FRET pair with an Ro 
of ~50 A for 50% energy transfer efficiency’? (Fig. 3b). In the structure 
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Figure 3 | NRT1.1 dimerization controlled by Thr 101 phosphorylation. 
a, Crosslinking of NRT1.1 with increasing concentrations of ethylene glycol 
bis-succinimidylsuccinate (EGS). b, The design of FRET assay. Dashed lines 
indicate the 11-residue-long linkers between the fluorescence proteins and 
the structurally resolved NRT1.1 N terminus. c, FRET measurements of 
wild-type (WT) and mutant NRT1.1. The mCFP-HCN-mYFP-NRT1.1 pair 
was used as negative control. Consistent with the loss of FRET signal, the 
T101D mutant failed to be crosslinked in solution (Extended Data Fig. 5b). 
d, A close-up view of Thr 101 at the NRT1.1 dimer interface. e, Thr 101- 
interacting residues with their side chains shown as sticks. 


of the putative NRT1.1 dimer, the N-terminal ends of the two NRT1.1 
molecules are about 40 A away from each other (Extended Data Fig. 7a). 
Therefore, FRET is expected to occur if NRT1.1 dimerizes in the mem- 
brane in the same fashion as seen in the crystal structure. As shown in 
Fig. 3c, a strong FRET signal measured by a spectrum-based approach 
was detected between the co-expressed mCFP-NRT1.1 and mYFP- 
NRT1.1 fusion proteins, but not in the negative control (Extended Data 
Fig. 7b). This result strongly suggests that the plant dual-affinity nitrate 
transporter can form a homodimer not only in the crystal but also in 
the cellular membrane. 


Thr 101 phosphorylation as a dimerization switch 
The phosphorylation site residue Thr 101 is strictly conserved among 
plant NRT1.1 orthologues and represents one of the hallmarks of 
the dual-affinity nitrate transporter’? (Extended Data Fig. 1). In the 
NRT1.1 structure, Thr 101 is located at the N-terminal end of TMH3 
and is entirely buried in a hydrophobic pocket formed among TMH2, 
TMH3 and THM4 (Fig. 3d, e). Notably, this pocket is directly adjacent 
to the dimer interface with one of its walls demarcated by three hydro- 
phobic interface residues: Leu 96, Leu 100 and Ile 104 (Figs 2d and 3e). 
Although the side chain of Thr 101 is unmodified in the crystal, its phos- 
phorylation is expected to induce major electrostatic and conforma- 
tional changes in its vicinity and have a direct impact on the dimer 
interface. This structural feature prompted us to postulate that the 
formation of the NRT1.1 dimer might be determined by the phosphor- 
ylation status of Thr 101 and that the two distinct affinity states of the 
transporter might be enabled by the difference in its oligomerization 
state. 

To test this hypothesis, we compared the wild-type NRT 1.1 with the 
phosphomimetic mutant, T101D, and the phosphorylation-defective 
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mutant, T101A, in the oocyte-based FRET experiments (Fig. 3c and 
Extended Data Fig. 7b). Consistent with the prediction, the phospho- 
mimetic mutant T101D, which has been previously shown to bear a 
monophasic high-affinity nitrate transporter activity'', completely lost 
the FRET signal, indicating a spatial separation of the two NRT1.1 
N-terminal domains, if not a complete disruption of the NRT1.1 dimer. 
By contrast, the phosphorylation-defective mutant T101A, which is 
known to transport nitrate in the low-affinity state'', generated a 
robust FRET signal similar to the wild-type protein. The matching 
levels of FRET between wild-type NRT1.1 and the T101A mutant 
indicates that the wild-type protein is mostly in the unmodified form 
under the nitrate-free condition of the FRET measurement. This is in 
agreement with previous studies showing that phosphorylated NRT1.1 
only started to accumulate when plants were exposed to nitrate. 
Together, these results not only confirm the functional relevance of 
the NRT1.1 dimer, but also indicate a dimerization-based switching 
mechanism for the dual-affinity nitrate transporter—unmodified NRT1.1 
forms a structurally coupled homodimer and functions as a low-affinity 
transporter, whereas phosphorylated NRT 1.1 undergoes dimer decoup- 
ling and adopts a high-affinity state. 


Substrate binding site and proton coupling 


In both protomers of the refined NRT1.1 dimer structure, an island of 
strong electron density is present in the middle of the transport tunnel 
between the N-terminal and C-terminal domains (Fig. 4a and Extended 
Data Fig. 8). The overall location of the density coincides with the 
substrate-binding sites of other MFS structures, indicating that it 
belongs to the substrate molecule, nitrate. Indeed, when nitrate was 
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Figure 4 | Substrate binding and energy coupling in NRT1.1. a, Intracellular 
view of the nitrate-binding pocket. Nitrate is shown as a stick representation 
together with electron density contoured at 4¢ from a F, —F, map calculated 
before the nitrate was modelled in. THMs are numbered (orange). E476 is a 
His 356-interacting residue, the mutation of which abolishes the transporter 
function of NRT1.1 (Extended Data Fig. 3b). b, Side view of the putative nitrate- 
binding site and the transporter tunnel with the clustered EXXER motif and 
K164. c, Nitrate uptake activities of the H356A mutant relative to wild-type 
NRT1.1 in the presence of 10 mM or 0.1 mM nitrate. y axis is the percentage of 
nitrate uptake compared to wild type. All results are the mean + s.d. of one 
experiment in quintuplicates or sextuplicates. d, Sequence alignment of eight 
NRT1 family members from Arabidopsis thaliana in regions surrounding H356 
and F511 of NRT1.1. 
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omitted from the cryo-protection buffer, this electron density comple- 
tely disappeared from both NRT1.1 molecules (Extended Data Fig. 8). 
The position of the nitrate density is slightly shifted between the two 
NRT1.1 molecules, which might reflect the mode of substrate release 
(Extended Data Fig. 8). 

Distinct from the nitrate/nitrite binding sites of NarK and NarU, 
which coordinate the substrate(s) with two opposing conserved Arg 
residues™*”®, the nitrate-binding pocket in NRT1.1 is predominantly 
formed by hydrophobic residues, including Leu 49, Val 53, Leu 78 and 
Phe 511 (Fig. 4a, b). His 356 on TMH7 is the only polar residue that 
is in close contact with nitrate, the precise binding mode of which 
cannot be resolved due to the resolution limit of the structure. On the 
basis of its close proximity to the nitrate density and the crystallization 
condition (pH = 4.5), His 356 probably stabilizes the substrate in the 
pocket through a charge-charge interaction. Its side-chain conforma- 
tion, meanwhile, is supported by two nearby residues: Tyr 388 and 
Glu 476. Although Tyr 388 and two other polar residues, Thr 360 and 
Thr 48, are also around the substrate, their hydroxyl groups do not 
seem to be at the optimal hydrogen bond distance (Fig. 4a). 

To validate the substrate-binding site, we mutated His 356 and compared 
the nitrate uptake activities of the wild-type and mutant transporters”. 
In support of a critical role of His 356 in substrate transport, its muta- 
tion to alanine completely abolished the transport activity of NRT1.1 at 
both high and low nitrate concentration (Fig. 4c). Notably, His 356 is 
not conserved among plant NRT 1.1 orthologues and Arabidopsis NRT1 
family members, which harbour either a tyrosine or a hydrophobic 
amino acid (Leu, Met or Phe) at the equivalent position (Fig. 4d). This 
key residue, nevertheless, has closely co-evolved with the adjacent res- 
idue, Phe 511. Among all NRT1.1 orthologues and paralogues, a com- 
bination of a polar and a hydrophobic side chain has been generally 
maintained between the two residues, indicating that one of them is respon- 
sible for specific nitrate binding (Fig. 4d and Extended Data Figs 1 and 2). 
NRTI1.1 is unique among all Arabidopsis NRT1 family members by 
featuring a histidine at the nitrate-binding pocket. This charged residue 
provides a plausible explanation for the high-affinity nitrate uptake 
activity acquired by NRT1.1, which is otherwise a member of LATS. 
Furthermore, the replacement of the histidine residue by tyrosine in 
some of the plant NRT1.1 orthologues raises a question about their dual 
affinity transporter function (Extended Data Fig. 1). 

In the PepT¢, structure, a conserved motif, EXXERFXYY, on TMH1 
has been identified to have an important role in proton coupling. Part 
of this motif, EXXER, is also found in all plant NRT1.1 orthologues 
(Extended Data Fig. 1). Together with the conserved residue Lys 164, 
this motif presents a cluster of interacting residues under the nitrate- 
binding pocket and facing towards the transport tunnel (Fig. 4b). Con- 
sistent with a key function in the symport cycle, alanine mutation of 
each of the four residues abrogated the transporter activity of NRT1.1 
in the oocyte-based nitrate uptake assay (Extended Data Fig. 3b). 
Surprisingly, these four residues have been simultaneously evolved into 
non-charged residues in two Arabidopsis NRT1 family members: 
AtNRT1.5 and AtNRT1.8 (Extended Data Fig. 2). Their documented 
pH-dependent nitrate transporter activities necessitate an alternative 
proton-coupling mechanism. 


Cytoplasmic structural elements 

NRT1.1 has an ~30-amino-acid-long N-terminal cytoplasmic segment, 
which is highly conserved among its plant orthologues (Extended Data 
Figs 1 and 2). In the crystal, this sequence adopts a well-ordered loop 
structure and forms a pronounced cleft between the NRT1.1 dimer 
(Extended Data Fig. 9). With several strictly conserved residues exposed 
to the solvent, this cleft presents a putative two-fold symmetric protein— 
protein interaction site with a potential role in recruiting kinases and 
phosphatases. Although the central linker sequence is mostly disor- 
dered in the crystal, its N-terminal region forms a stable amphipathic 
a-helix (Figs 1b and 2a), providing yet another potential protein- 
docking site. 
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Figure 5 | A dimerization switch model. The non-phosphorylated and 
structurally coupled NRT1.1 dimer functions as an ‘in-phase’ homodimeric 
low-affinity nitrate transporter (right). Once phosphorylated, the NRT1.1 
dimer is decoupled, and each molecule functions as an independent high- 
affinity nitrate transporter (left). Different shapes of the putative substrate- 
binding site at the central transport tunnel reflect its differential nitrate-binding 
properties. 


Discussion 

The crystal structure of NRT1.1 reveals a biologically relevant dimer, 
the dynamic coupling and decoupling of which is controlled by the 
phosphorylation of a single residue near the dimer interface. Because 
the same post-translational modification switches the mode of action 
of the dual-affinity transporter, we propose that dimer assembly and 
disassembly enables NRT1.1 to toggle between the low-affinity and high- 
affinity states with an overlapping, if not the same, substrate-binding 
site. In this model (Fig. 5), structural engagement of two protomers at 
the interface allosterically regulates the affinity of substrate binding at 
the central transport tunnel. Whether dimer decoupling itself is sufficient 
to shift NRT 1.1 into the high-affinity mode awaits future analysis. 

Previous structural studies of several MFS members have estab- 
lished a ‘rocker-switch’ mechanism of substrate transport, in which 
the transporters cycle through outward-facing, occluded, and inward- 
facing conformations”'”’. The structure of the dimeric unmodified 
NRT1.1 reveals a buried Thr 101 phosphorylation site in the inward- 
facing conformation. Thr101 phosphorylation, therefore, probably 
occurs when the dimeric transporter adopts either the outward-facing 
or occluded conformation. It is equally possible that the unmodified 
dimeric transporter is in equilibrium with the monomeric form, which 
is susceptible to phosphorylation. 

Despite the current structure, questions remain as to how NRT1.1 
senses nitrate and transduces the signal. Our structure reveals only one 
nitrate-binding site within the substrate transport tunnel. However, 
there might be an additional nitrate-binding site responsible for sig- 
nalling, which is excluded from the inward-facing conformation. Previous 
studies have suggested that dephosphorylation of Thr 101 is required 
for the low-affinity sensor function of NRT1.1 (ref. 17). It is possible 
that dimerization mediates this signalling function of the transporter in 
the same manner as found in many common cell-surface receptors. 
The high-affinity sensor function of NRT1.1, on the other hand, might 
involve an entirely different mechanism. 

Transporter oligomerization and phosphorylation have been impli- 
cated in the proper functions of a number of MFS members, such as 
LacS (ref. 34), GLUT1 (ref. 35), TetL (ref. 36) and hRFC (ref. 37). The 
crystal structure of the NRT1.1 dimer not only establishes a structural 
framework for understanding its dual-affinity nitrate transporter and 
receptor activities, but also reveals how protein oligomerization and 
post-translational modification can synergistically expand the func- 
tional capacity of an MFS transporter. 


METHODS SUMMARY 


Detailed descriptions of the following experimental procedures can be found in 
the Methods: protein expression and purification; protein crystallization, data col- 
lection and structure determination; nitrate transporter assay in oocytes; FRET 
assay and data analysis; and details for crosslinking experiment. 
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IRAS 04368+2557 is a solar-type (low-mass) protostar embedded 
ina protostellar core (L1527) in the Taurus molecular cloud’”, which 
is only 140 parsecs away from Earth, making it the closest large star- 
forming region. The protostellar envelope has a flattened shape with 
a diameter of a thousand astronomical units (1 AU is the distance 
from Earth to the Sun), and is infalling and rotating*~. It also has a 
protostellar disk with a radius of 90 au (ref. 6), from which a plane- 
tary system is expected to form’*. The interstellar gas, mainly con- 
sisting of hydrogen molecules, undergoes a change in density of about 
three orders of magnitude as it collapses from the envelope into the 
disk, while being heated from 10 kelvin to over 100 kelvin in the mid- 
plane, but it has hitherto not been possible to explore changes in 
chemical composition associated with this collapse. Here we report 
that the unsaturated hydrocarbon molecule cyclic-C;H, resides in 
the infalling rotating envelope, whereas sulphur monoxide (SO) is 
enhanced in the transition zone at the radius of the centrifugal barrier 
(100 + 20 av), which is the radius at which the kinetic energy of the 
infalling gas is converted to rotational energy. Such a drastic change 
in chemistry at the centrifugal barrier was not anticipated, but is 
probably caused by the discontinuous infalling motion at the cen- 
trifugal barrier and local heating processes there. 

We have conducted high-spatial-resolution observations of the milli- 
metre and submillimetre wave lines of cyclic-C3H and SO in the 500 au 
region around IRAS 04368+ 2557 (Methods). Figure 1a shows the observed 
integrated-intensity map of the 523-45, rotational line of cyclic-C3H2 
(where Jxx: represents the rotational energy level of an asymmetric top 
molecule; see the footnote of Extended Data Table 1). The intensity 
distribution shows a double-peaked structure, where the southern and 
northern peaks are separated from the protostar position (the contin- 
uum peak position: Extended Data Fig. 1) by 1’’ (140 au). Figure 1b 
shows the position-velocity (PV) diagram along the north-south line 
centred at the protostar’s position. A rotation signature is clearly seen 
outward of 100 au from the protostar, where the brightest velocity 
component is redshifted and blueshifted for the northern and southern 
parts, respectively. This situation is schematically illustrated in Fig. 1c. 
Such a rotation signature abruptly disappears inward of 100 au from 
the protostar. In addition to this signature, weak blueshifted and red- 
shifted components can also be recognized in the northern and south- 
ern parts, respectively (Fig. 1b). The existence of such counter-velocity 
components indicates that the gas motion is not simple rotation but 
accompanies infalling motion, as shown in Fig. 2a. Most importantly, 
only the infalling (counter-velocity) components are seen towards the 
protostar position. This means that cyclic-C3H), is almost absent inward 
of 100 au. The other cyclic-C3H) lines (551-449, 913-827/923-8)7 and 


10 ,10-919/101,10-909) also show similar distributions (Extended Data 
Fig. 2). 

To account for this kinematic feature, we developed a toy model des- 
cribing an infalling rotating envelope, where the gas motion is approxi- 
mated by the motion of a particle for simplicity (Fig. 2a) (Methods and 
Extended Data Fig. 3). According to this model, the maximum infalling 
velocity towards the protostar position is half of the maximum rotation 
velocity at the centrifugal barrier of the infalling gas (that is, half the 
centrifugal radius). In fact, the observed maximum infalling velocity of 
cyclic-C3H, towards the protostar position is just half of the maximum 
rotation velocity (Fig. 1b). Therefore, the radius that gives the maxi- 
mum rotation velocity in the cyclic-C;H, emission is the position of the 
centrifugal barrier. 

We simulate the PV diagram by using the radius of the centrifugal 
barrier and the maximum rotation velocity as parameters. In the simu- 
lation, we consider only the velocity field, and the highest and lowest 
velocities at a given offset from the protostar (Fig. 2b) are compared 
with the PV diagram of cyclic-C;H>. The best-fitting result is shown in 
Fig. 2c. The overall behaviour of the PV diagram is well explained even 
by this simple model. From this result, the radius of the centrifugal barrier 
and the maximum rotation velocity are derived to be 100 + 20 au and 
1.8 +0.2kms |, respectively. The radius of the centrifugal barrier is 
close to the radius of the Keplerian disk (90 Av)*. The total mass of the 
protostar and the inner disk is evaluated to be 0.18 + 0.05 solar masses, 
by using the radius of the centrifugal barrier and the maximum rota- 
tion velocity derived above. Here, the error is estimated from those of 
the radius of the centrifugal barrier and the maximum rotation velocity 
by error propagation. This is consistent with the protostar mass reported 
in ref. 6 (0.19 = 0.04 solar masses) from the measurement of the Kep- 
lerian rotation. 

In contrast to the cyclic-C3H, lines, the SO (Jy = 7-67) line shows a 
compact single-peaked distribution centred at the protostar (Fig. 1a) 
(Un represents the rotational energy level of SO; see the footnote of 
Extended Data Table 1). The SO distribution seems to fill up the dip of 
the cyclic-C3H) distribution. The PV diagram of the SO line emission 
along the north-south line is completely different from that of cyclic- 
C3Hz (Fig. 1d). It consists of two components. One is a component whose 
velocity is proportional to the position offset from the protostar. The 
other is a weak and broad component whose velocity width is as broad 
as 6kms '. The anticorrelation between the distributions of cyclic- 
C3H), and SO is, thus, evident. When we compare the PV diagram of 
SO with that of cyclic-C3H,, it is most likely that the former component 
of SO comes from a rotating ring whose radius is close to the innermost 
radius of the cyclic-C,H), distribution, corresponding to the radius of 
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the centrifugal barrier (Fig. 1c). On the other hand, the latter component 
seems to come from the inner disk, although it is much fainter than the 
ring component. These features are essentially the same for the other SO 
line (Uy = 76-65). We conclude that a drastic chemical change occurs 
at the centrifugal barrier. (See the Methods and Extended Data Figs 2, 
4, 5, 6 and 7 for details.) 

We observed four lines of cyclic-C3H, and two lines of SO in total 
(Extended Data Table 1). We analysed them to derive physical conditions of 
emitting regions for both molecules by using a non-local-thermodynamic- 
equilibrium (non-LTE) large-velocity-gradient (LVG) code (Methods 
and Extended Data Fig. 8). The gas kinetic temperatures of the cyclic- 
C3H, emitting region are derived to be 23-33 K, 30 K and 23 K, for the 
1’ X 1”’ region at the centre, 1'’ north of the centre and 1’’ south of 
the centre, respectively, where the H, density ranges from 6 X 10° to 
1 X 10° cm *. These values are almost consistent with the reported model? 
(H, density of 1.3 X 10° cm * and the temperature of 30 K at 100 AU). 
On the other hand, the gas kinetic temperature of the SO-emitting region 
is constrained to be higher than 60 K, and the H, density to be higher than 
3X 10°cm * for the 1'’ X 2"’ region centred at the protostar position. 

Given that the sublimation temperature of SO is 50 K, it is most likely 
that SO is liberated into the gas phase from grain mantle at the centrif- 
ugal barrier. Although the iced SO is not identified by infrared obser- 
vations, it is predicted by the chemical model’®"’. Liberation of the S 
atom into the gas phase followed by the reaction with OH may also 
contribute to enhancement of SO (ref. 12). Possible mechanisms for 
the SO liberation are accretion shock in front of the centrifugal barrier, 
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Figure 1 | IRAS 0436842557 in 
the cyclic-C3H, (523432) and SO 
(Un = 7s-67) lines. a, Integrated 
intensity distributions of 
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outflow shock on the disk surface, and the protostellar heating. These 
possibilities are described in the Supplementary Information. 

Further in the mid-plane of the inner disk, SO is probably depleted 
again onto dust grains. The depletion timescale is as short as 10-200 years 
for the H, density of 6 X 10° cm * to reach 1 X 10° cm *. Therefore, SO 
mainly exists around the centrifugal barrier. However, the PV diagram 
of the SO emission also shows a weak broad component inward of the 
centrifugal barrier. This suggests that SO partly survives even inward 
of the barrier. The gas infalling directly onto the inner disk surface may 
also contribute to the broad component"*. We predict the velocity of the 
Keplerian rotation as a function of the radius from the estimated pro- 
tostellar mass derived from the cyclic-C3H) data (Fig. 2c). The expected 
velocity can roughly explain the PV diagram inward of the centrifugal 
barrier. 

Similarly, cyclic-C3H) is depleted onto dust grains in the mid-plane 
of the disk as in the case of SO. Cyclic-C3H2 may also be destroyed in 
the transition region by a gas-phase reaction with the oxygen atom. 
Although the fresh gas containing cyclic-C3H) is continuously supplied 
outside the centrifugal barrier, it is not supplied efficiently within the 
disk. Hence, the abundance of cyclic-C3H, in the gas phase becomes 
much lower inward of the centrifugal barrier than in the infalling rotat- 
ing envelope, as observed. 

These results demonstrate a drastic chemical change at the centri- 
fugal barrier in the course of the formation of the inner disk. It does not 
mean that SO and cyclic-C3H, are chemically related to each other directly; 
the change is caused by the discontinuous infalling motion of the gas at 
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Figure 2 | A model of an infalling rotating envelope. a, A schematic 
illustration of the infalling rotating envelope. The observer is on the left-hand 
side, looking at the envelope in an edge-on configuration. b, The highest and 
lowest velocities calculated with the toy model (Methods). The emissions of 
the coloured closed circles come from the corresponding coloured closed circles 


the centrifugal barrier and local heating processes there (Supplemen- 
tary Information). When interstellar matter is brought into the disk, it 
must experience this physical situation and is subject to significant 
chemical processing including gas—grain interactions in the transition 
zone. The chemical compositions of interstellar clouds have arbitrarily 
been assumed as the initial conditions set for the chemical evolution 
models of the disk'*"!”. This assumption has now been found to be over- 
simplified: the chemical processing in the transition zone should be 
seriously considered. This is also true for the case of tracing the origin 
of the pre-solar materials found in meteorites back to interstellar matter. 


METHODS SUMMARY 


Observations were carried out with the Atacama Large Millimeter/submillimetre 
Array (ALMA) in 2012. We observed the rotational spectral line emissions of cyclic- 
C3H, and SO as well as the dust continuum emission in the 249-GHz, 261-GHz, 
338-GHz and 351-GHz regions. The synthesized beam is about 0.8’’ X 0.7'' for 
249 GHz and 261 GHz and about 0.7’’ X 0.5’’ for 338 GHz and 351 GHz. The 
integrated intensity maps and the PV diagrams of the four cyclic-C3Hz lines and 
the two SO lines are similar to each other for each molecule. For SO, the total flux 
of the Jy = 7s—67 line emission observed with ALMA recovers 60% of that observed 
with the single-dish telescope in the Atacama Submillimetre Telescope Experiment 
(ASTE) (with a beam size of 22'’), indicating that the SO line emission is concen- 
trated around the protostar. The effect of the outflow is confirmed to be negligible 
by examining the PV diagram of the SO (Jn = 73-67) line along the outflow axis. To 
account for the PV diagrams of the C3H, lines, we use a simple kinematic model for 
the infalling rotating envelope, revealing that the maximum infalling velocity towards 
the protostar is just half of the maximum rotation velocity. We evaluated the gas 
kinetic temperature and the H density using the non-LTE LVG code. 


Online Content Any additional Methods, Extended Data display items and Source 
Data are available in the online version of the paper; references unique to these 
sections appear only in the online paper. 
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Low-loss transmission and sensitive recovery of weak radio- 
frequency and microwave signals is a ubiquitous challenge, crucial 
in radio astronomy, medical imaging, navigation, and classical and 
quantum communication. Efficient up-conversion of radio-frequency 
signals to an optical carrier would enable their transmission through 
optical fibres instead of through copper wires, drastically reducing 
losses, and would give access to the set of established quantum opti- 
cal techniques that are routinely used in quantum-limited signal 
detection. Research in cavity optomechanics’” has shown that nano- 
mechanical oscillators can couple strongly to either microwave** or 
optical fields*’. Here we demonstrate a room-temperature optoelec- 
tromechanical transducer with both these functionalities, following 
a recent proposal* using a high-quality nanomembrane. A voltage 
bias of less than 10 V is sufficient to induce strong coupling*®’ be- 
tween the voltage fluctuations in a radio-frequency resonance circuit 
and the membrane’s displacement, which is simultaneously coupled 
to light reflected off its surface. The radio-frequency signals are 
detected as an optical phase shift with quantum-limited sensitivity. 
The corresponding half-wave voltage is in the microvolt range, 
orders of magnitude less than that of standard optical modulators. 
The noise of the transducer—beyond the measured 800 pV Hz~ 
Johnson noise of the resonant circuit—consists of the quantum noise 
of light and thermal fluctuations of the membrane, dominating the 
noise floor in potential applications in radio astronomy and nu- 
clear magnetic imaging. Each of these contributions is inferred to 
be 60 pV Hz~‘/? when balanced by choosing an electromechanical 
cooperativity of ~ 150 with an optical power of 1 mW. The noise 
temperature of the membrane is 300 K divided by the cooperativity. 
For the highest observed cooperativity of 6,800, this leads to a projected 
noise temperature of 40 mK and a sensitivity limit of 5 pV Hz’. 
Our approach to all-optical, ultralow-noise detection of classical 
electronic signals sets the stage for coherent up-conversion of low- 
frequency quantum signals to the optical domain*"’. 

Optomechanical and electromechanical systems’ have gained con- 
siderable attention recently for their potential as hybrid transducers 
between otherwise incompatible (quantum) systems, such as photonic, 
electronic and spin degrees of freedom*'®"”. The mechanical coupling 
of radio-frequency or microwave signals to optical fields is particularly 
attractive for present-day and future quantum technologies. Photon- 
phonon transfer protocols viable all the way to the quantum regime 
have already been implemented separately in the radio- and optical- 
frequency domains”’*"*. 

Among the optomechanical systems that have been considered for 
radio-to-optical transduction* °’°, we choose an approach® based ona 
nanomembrane’”” with a high quality factor, Q, ~ 3 X 10°, which is 
coupled capacitively’® to a radio-frequency resonance circuit (Fig. 1). 
Together with a four-segment gold electrode, the membrane forms 
a capacitor, whose capacitance, C,,,(x), depends on the membrane- 
electrode distance, d+ x. With a tuning capacitor Cp, the total capa- 
citance, C(x) = Cy + C,,(x), forms a resonance circuit with a typical 


quality factor Qic =\/L/C / R= 130 using a custom-made coil wired 
ona low-loss ferrite rod. This yields an inductance L = 0.64 mH anda 
loss R~ 20Q. The circuit’s resonance frequency Q1c¢=1 / VIC is 
tuned to the frequency, Q,,/2m = 0.72 MHz, of the fundamental drum 
mode of the membrane. The membrane-circuit system is coupled to a 
propagating optical mode reflected from the membrane. 

The electromechanical dynamics is described most generically by 
the Hamiltonian® 


H= t t a8 


qVac (1) 


2L 2m 2, 


Driven displacement (a.u.) 


680 700 
Frequency (kHz) V, c 7] 
2 


Figure 1 | Optoelectromechanical system. a, A 500-j1m-square membrane of 
Al-coated® SiN in vacuum (<107° mbar) forms a position-dependent 
capacitor, C,,(x = 0) ~ 0.5 pF, with a planar, four-segment gold electrode in the 
immediate vicinity (0.9 um <d<6 um). The membrane electrode’s potential 
is electrically floating. The membrane’s displacement is converted into a 
phase shift of the laser beam reflected from the membrane. b, The membrane 
capacitor is part of an LC circuit, tuned to the mechanical resonance 
frequency by means of a tuning capacitor Cy ~ 80 pF (Supplementary 
Information). A bias voltage, V4., couples the excitations of the LC circuit to 
the membrane’s motion. The circuit is driven by a voltage V,, which can be 
injected through the coupling port 2 or picked up by the inductor from the 
ambient radio-frequency (RF) radiation. c, For d = 1 um, the optically observed 
response of the membrane to a weak excitation of the system shows a split 
peak due to strong electromechanical coupling. a.u., arbitrary units. 
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where ¢ and q, respectively the flux in the inductor and the charge on 
the capacitors, are conjugate variables for the LC circuit, and x and 
p respectively denote the position and momentum of the membrane 
with an effective mass m. The last two terms represent the charging 
energy, Uc(x), of the capacitors, which can be offset using an externally 
applied bias voltage, Vac (Fig. 1). This energy, corresponding to the 
charge q= Va-C(X), leads to a new equilibrium position, x, for the 
membrane. Furthermore, the position-dependent capacitive force 
F(x) = —dUc/dx causes spring softening, reducing the membrane’s 
motional eigenfrequency by AQ,.~ — C’(X) Vi. jf 2mQ,y (ref. 19). 

Much richer dynamics than this shift may be expected from the 
mutually coupled system described by equation (1). For small excur- 
sions, (5g, 6x), around the equilibrium, (q,x), it can be described by the 
linearized interaction term* (Supplementary Information) 


6q bx (2) 


parameterized by either the coupling parameter G= — VacC'(x) /C(x) 
or the electromechanical coupling energy, /igem, where h is Planck’s 
constant (h) divided by 2x. This coupling leads to an exchange of energy 
between the electronic and mechanical subsystems at the rate g..,; if this 
rate exceeds their dissipation rates, respectively I,¢ = Q)¢/Qrc and 
Din = Qy/Qew they hybridize into a strongly coupled electromech- 
anical system**’. Our system is deeply in the strong coupling regime 
(2gem = 27 x 27 kHz > Pc = 22 x 5.5 kHz>Ip = 22 X 20 Hz) fora 
distance d = 1 um anda bias voltage V4. = 21 V (Fig. 1c). Here we detect 
the strong coupling using an independent optical probe on the mechanical 
system. 

We have performed a series of experiments in which the bias voltage 
is systematically increased with a different sample, a larger distance, 
d= 5.5 um, and a lower mechanical dissipation, [),,/2m = 2.3 Hz. The 
system is excited inductively through port 2 (Fig. 1b), inducing a weak 
radio-frequency signal of root mean squared amplitude V, = 670 nV, 
at a frequency 2 ~ Qyc. The response of the coupled system can be 
measured both electrically, as the voltage across the capacitors (port 1 
in Fig. 1b), and optically, by analysing the phase shift of a light beam 
(wavelength 2 = 633 nm) reflected from the membrane. Both signals 
are recorded with a lock-in amplifier, which also provides the excita- 
tion signal. 

The electrically measured response (Fig. 2a) shows the signature of a 
mechanically induced transparency”, indicated by the dip in the LC 
resonance curve. Independently, we observe the radio-frequency sig- 
nal in the LC circuit optically via the membrane mechanical dynamics 
(Fig. 2b). In particular, the electromechanical coupling leads to a 
broadening of the mechanical resonance to a new effective linewidth, 
Teg =(1+Ceom) Im, where Cem is the electromechanical cooperativity: 


Hy = Gdgdx =hgem 


Agen 
The width of the induced transparency dip and the mechanical line- 
width grow in unison, and in agreement with our expectations, as 
Depp Vie (Fig. 2b, inset). Both of these features also shift to lower 
frequencies as the bias voltage is increased, following the expected 
AQ — Vj. dependence’’. We note that in each experiment we have 
tuned the LC resonance frequency to Qy. 

Using the model based on the full Langevin equations (Supplemen- 
tary Information), derived from the Hamiltonian in equation (1), we 
fit the electronically and optically measured curves, and for the two 
curves obtain fit parameters 2,4, Q1c, [.c and G that agree typically to 
within 1%. Together with the intrinsic damping determined indepen- 
dently from thermally driven spectra, the system’s dynamics can be 
quantitatively predicted. Our data analysis allows us to quantify the 
coupling strength in three independent ways: analysis of the mech- 
anical responses’ spectral shape; comparison of the voltage and dis- 
placement modulation amplitudes; and in terms of the frequency 
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Figure 2 | Mechanically induced transparency. Response of the coupled 
system to a weak excitation at frequency Q (through port 2 in Fig. 1b) 
probed through the voltage modulation in the LC circuit (at port 1) (a) and 
through the optical phase shift (b). The data (coloured dots) measured for 
five different bias voltages agree excellently with model fits (curves) respectively 
corresponding to g.m/2m = 280, 470, 810, 1,030 and 1,290 Hz (from bottom 
to top). Each curve is offset so that its baseline corresponds to the Vac 
indicated between the panels. Grey points indicate Q,, values extracted for 
each set of data. A shift of AQ,, oc — V3, is fitted with the dashed line. 

Inset, effective linewidth of the mechanical resonance extracted from full 
model fits to the electrically (circles) and optically (boxes) measured response 
and simple Lorentzian fits to the optical data (diamonds). The solid line 
shows the expected Derg Vie scaling. 


shift’? of the mechanical mode. Finally, we compare these experi- 
mental values with a theoretical estimate accounting for the geometry 
of the electromechanical transducer. For Vg. = 125 V, we find that 
G=10.3kVm * following the first method, and similar values using 
the three others (Supplementary Information), demonstrating our thorough 
understanding of the system. 

In another experimental run (d = 4.5 jim; Fig. 3), we characterized 
the strong electromechanical coupling’** using the normal-mode 
splitting that gives rise to an avoided crossing of the resonances of 
the electronic circuit and the mechanical mode, as the latter is tuned 
through the former using the capacitive spring effect’. In contrast to 
earlier observations**’, we simultaneously witness the strong coupling 
through the optical readout, in which the recorded light phase repro- 
duces the membrane motion (Fig. 3c, e). Again, the predictions derived 
from the Langevin equations are in excellent agreement with our obser- 
vations, yielding a cooperativity of Cam =3,800 for these data with 
m = 24ng and I,,/2n = 3.1 Hz. 

We now turn to the performance of this interface as a radio- 
frequency/optical transducer. A relevant figure of merit for the pur- 
pose of bringing small signals onto an optical carrier is the voltage, V,,, 
required at the input of the series circuit to induce an optical phase shift 
of nm. Achieving minimal V,, requires a balance between strong coup- 
ling and induced mechanical damping. For the optimal cooperativity, 
Cem = 1, we find that 


1 
Vi = 2 V/ MLD ml cAQ, = 140 pV (4) 


at resonance (Q;=Qy=2,c=Q), which is orders of magnitude 
below the corresponding figure of merit for not only commercial 
modulators optimized for decades by the telecom industry, but also 
explorative microwave photonic devices*'”* based on electronic non- 
linearities. It is interesting to relate this performance to more funda- 
mental entities, namely the electromagnetic field quanta that constitute 
the signal. Indeed it is possible to show that the quantum conversion 
efficiency, defined here as the ratio of optical sideband photons to the 
radio-frequency quanta extracted from the source, V,I/hQic, for Con>>1, 
is given by (Supplementary Information) 


Dear 
Neo = A(kXzp¢)° as, 


= (5) 
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Figure 3 | Strong-coupling regime. a, Measured coherent coupling rate, 


28em/2T, as a function of bias voltage (points), and linear fit (line). The shaded 
area indicates the dissipation rate, 7,-/2m ~ 5.9 kHz of the LC circuit. 

b-e, Normalized response of the coupled system as measured on port 1 

(Fig. 1c) (b, d) and via the optical phase shift induced by membrane 
displacements (c, e). The colour scales encode normalized voltage (b) and 
displacement modulation (c). On tuning of the bias voltage, the mechanical 
resonance frequency is tuned through the LC resonance, but owing to the 
strong coupling an avoided crossing is very clearly observed. Panels d and 

e show the spectra corresponding to the orange lines in b and, at Va. = 242 V, 
where the electronic and mechanical resonance frequencies coincide. Points are 
data; orange line is the model fit. 


This corresponds to the squared effective Lamb-Dicke parameter, 
(kxzpp) = (2n/1)°h/2MQm, enhanced by the number of photons sam- 
pling the membrane during the membrane excitations’ lifetime (®.a, is 
the photon flux and k is the wavenumber). For the experiments shown in 
Fig. 2, we deduce a conversion efficiency of 0.8% from the independently 
measured radio-frequency voltage and optical phase modulation. 
Although this result is limited by the optical power in this interfero- 
meter, we performed tests to confirm that the membranes can support 
optical readout powers of more than ®,,,hc/2 = 20 mW without degra- 
dation of their (intrinsic) linewidth. We thus project that conversion 
efficiencies of the order of 50% are available. Note that this transducer 
constitutes a phase-insensitive amplifier, and can thus reach conversion 
efficiencies greater than one—at the expense of added quantum noise. 

For the recovery of weak signals, the sensitivity and bandwidth of 
the interface is of greatest interest. The signal at the optical output of 
the device is the interferometrically measured spectral density of the 
optical phase, g, of the light reflected from the membrane: 


0 eff |2 i 
Sip = (2k)” | x55" (|Gyicl’Svv + Sz) +See (6) 


The voltage V, at the input of the resonance circuit (denoted here as its 
spectral density S\v) is transduced to a phase shift via the circuit’s sus- 
ceptibility, x,c, the coupling, G, the effective membrane susceptibility, 
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yt and the optical wavenumber, k (Supplementary Information). The 
sensitivity is determined by the noise added within the interface. This 


includes, in particular, the imprecision in the phase measurement ers 


and the random thermal motion of the membrane induced by the 
Langevin force (St%.). The former depends on the performance of the 


interferometric detector used and can be quantum limited (sv cp ie ). 


We demonstrate the sensitivity and the noise performance of the 
transduction scheme by measuring the noise as a function of the input 
circuit resistor and its temperature (Fig. 4). Because the home-made, 
high-Q inductor is too sensitive to the ambient radio-frequency radi- 
ation (Supplementary Information), we use a shielded commercial 
inductor (Picoelectronics) resulting in a lower value, Qc = 47, for 
these measurements. Red traces in Fig. 4a and Fig. 4b respectively 
present the optically measured noise spectrum and the corresponding 
voltage noise. On resonance, the dominant contribution is the Johnson 
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Figure 4 | Voltage sensitivity and noise. Noise characterization of the 
transducer with contributions from Johnson noise (violet), optical quantum 
phase noise (yellow) and membrane thermal noise (green). a, Optically 
measured noise (red) is well reproduced by a model vs (blue). b, Data and 
models as in a, but divided by the interface’s response function, dae ,and 
thus referenced to the voltage, V,, induced in the circuit. c, Noise temperature 
of the amplifier (errors, s.d.). It is determined using the Y-factor method, at 
the resonance frequency (dark red points), and in a 10-kHz-wide band 
around the resonance (light red points), as a function of external loading. Lines 
are the model of equation (7), broken down into contributions as in a and 

b. Inset, example of a noise temperature measurement at R, = 1,250Q. 
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noise (Siy~(800 pv)” Hz~!) of the circuit (violet). Off resonance, 
optical quantum (shot) noise (yellow) limits the phase sensitivity to 
oe =(18 nrad)* Hz~!, corresponding to membrane displacements of 
(1.5 fm)* Hz’. In this experiment, we used a home-built interfero- 
meter operating at 2 = 1,064nm and with a light power of ~1 mW 
returned from a membrane with m = 64 ng and J,,,/2n = 20 Hz. The 


square root of the phase sensitivity, ,/Sj7, can be translated into a 


voltage sensitivity limit by division by the transfer function |z 
|akx! Gy of the transducer. With the cooperativity chosen here, 
Cem = 150, this corresponds to a voltage noise level of 60 pV Hz~!/? 
within the resonant bandwidth of this proof-of-principle transducer, 
but higher powers, and more sensitive optomechanical transduction’®”* 
could readily improve this number. From our model, we furthermore 
deduce that the contribution of the thermal motion of the membrane 
adds an equal amount of voltage noise (Fig. 4b, green), such that, at this 
cooperativity, these noise contributions are balanced and their sum 
minimized to 84 pV Hz~!/?. 

Further analysis of the transducer noise has been performed by 
measurements with an additional, ‘source’, resistor, R,, in series with 
the inductor of the circuit (Fig. 4c). The input to the circuit thus con- 
sists of the Johnson noise of both resistors, (Sv) = 2kp(RTR+R,T;). 
We cool the source resistor using liquid nitrogen, and optically mea- 
sure the displacement of the membrane both at room (T, = 300 K) and 
at liquid nitrogen temperature (T, = 77 K). We can thus determine the 
amount of noise added by the transducer using the Y-factor method”. 
From equation (6), we expect to find a noise temperature of (Sup- 


plementary Information) 
1 Com 


1 1/1 
Th= 1)TR4 Tmt 7 
(. ) , Ne (= Com ( ) 


at resonance, where the three summands are due to the Johnson noise 
of the circuit’s loss, R = 60 Q, at Tp = 300 K, the membrane’s thermal 
fluctuations (T,, = 300K) and the noise in the optical readout 
(T, ~ 50 mK), respectively. We note that both the circuit’s loading, 
Ne=R,/(Rs +R), and the cooperativity, Com = RCem(Rs =0 Q)/(R; +R) 
are now functions of the source resistance. In this experiment, we vary 
the cooperativity by varying the source resistor from Cem(R; =0 Q) = 
550 to Cem(Rs =2 kQ) = 18, and find a noise temperature consistent 
overall with equation (7), with the lowest measured value reaching down 
to 24K (Fig. 4c). 

The challenge of engineering a low-loss, overcoupled electronic 
resonance circuit (y.— 1) aside, the transducer itself adds only very 
little noise (green and yellow lines in Fig. 4c, representing the second 
and third terms in equation (7), respectively). For example, at a coop- 
erativity of Co = 70 achieved with R, = 400 Q, subtracting the Johnson 
noise from the total noise yields optical quantum phase noise and 
membrane thermal noise temperatures of 4K. Remarkably, the mem- 
brane contribution, which can usually only be suppressed by cryogenic 
cooling, is strongly reduced by the cooperativity parameter (0¢ Tin /Cem). 
The highest cooperativity we have obtained is Cem = 6,800, by applying 
equation (3) to the data of Fig. 1c. This implies that membrane noise 
temperatures down to 40 mK can be expected, corresponding here to a 
voltage noise level of 5 pV Hz~'/. 

For comparison, we performed measurements with an arrangement 
of ultralow-noise operational amplifiers connected directly to port 1. 
The amplifier is based on junction field-effect transistors (JFETs) and 
combines low input voltage noise (nominally 4 nV Hz~'/?) with extre- 
mely low current noise (nominally 2.5 fA Hz~'/*), as required” for 
measurements on a relatively high source impedance, which here 
amounts to RQic~140 kQ at port 1. In practice, with a gain of 1,000, 
the best voltage sensitivity we have obtained is only S¥yy°"(Qrc) = 
(130 pV)” Hz~! over the bandwidth of the LC resonance. Similar per- 
formance levels—on a par with the transducer discussed here—are 
expected even for ideal operation of other amplifiers described in the 
scientific and technical literature (Supplementary Information). Apart 


| = 
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from being competitive with standard electronics in its noise figures, our 
transducer provides a new functionality owing to its direct compatibility 
with fibre optical communication lines. The presented optoelectrome- 
chanical transducer also compares very favourably with previous proof- 
of-principle mechanical amplifiers for radio-frequency~’ and micro- 
wave’ radiation (Supplementary Information). 

Because our transducer noise floor is well below the room- 
temperature Johnson noise from the circuit’s loss, R = 60 Q, this app- 
roach can be of particular relevance in applications where electronic 
Johnson noise is suppressed. For example, for direct electronic 
(quantum) signal transduction, the resonance circuit is overloaded 
(y4e— 1) with a cold transmission line that carries the signal of interest, 
but no Johnson noise. In radio astronomy”, highly efficient antennas 
looking at the cold sky can have noise temperatures far below room 
temperature. The usually required cryogenically cooled pre-amplifiers 
might be replaced by our transducer—a critical advantage for satellite 
missions—and extension to gigahertz frequencies should be straight- 
forward using a.c. driving*. Direct and efficient conversion of radio- 
frequency signals into optics could save substantial resources in large 
phased-array antennas. Finally, in NMR experiments including imaging, 
cooled pickup circuits can deliver a significant sensitivity improvement, 
but this approach challenges present amplifier technology*”. 


METHODS SUMMARY 


The capacitor is fabricated by standard clean-room microfabrication techniques. 
Electrodes made of gold (200 nm thick) are deposited on a glass substrate and 
structured by ion-beam etching. Each segment is 400 1m long, with 60-um gaps 
between the segments. Pillars ranging in height from 600 nm 1 pm are placed to 
define the membrane-electrode distance. The inductor is wound with Litz wires to 
ensure a high Q-factor. A variable trimming capacitor is used to tune the resonance 
frequency of the LC circuit. 

The mechanical resonator consists of a 50-nm-thick aluminium layer on top of 
a high-stress stoichiometric SiN layer with a thickness of 100 or 180 nm depending 
on the sample. The aluminium layer is deposited on top of the whole wafer after 
the membranes have been released. Photolithography and chemical etching are 
subsequently used to remove the metal from the anchoring regions and from a 
circle in the middle of the membrane. The metal layer on SiN typically causes a 
10% decrease in the eigenfrequency of the fundamental mode. 

Optical interferometry is carried out using a commercial Doppler vibrometer 
(MSA-500 Polytec) and a home-made Michelson interferometer (for the data set 
in Fig. 4). The home-made Michelson interferometer uses shot-noise-limited, 
balanced-homodyne detection with a high-bandwidth (0-75 MHz) InGaAs receiver. 
The two d.c. outputs from the detector are used to generate the differential error 
signal, which is then fed to the piezoelectric actuator in the reference arm for locking 
the interferometer. The radio-frequency output of the detector is high-pass-filtered 
and fed to a spectrum analyser to record the vibrations of the membrane. Absolute 
calibration of the mechanical amplitude is carried out through a known modulation 
of the piezoelectric actuator at a frequency close to the mechanical peak. 
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Self-assembly of hydrogen-bonded 
two-dimensional quasicrystals 
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Kenneth W. Henderson! & S. Alex Kandel! 


The process of molecular self-assembly on solid surfaces is essentially 
one of crystallization in two dimensions, and the structures that 
result depend on the interplay between intermolecular forces and 
the interaction between adsorbates and the underlying substrate’. 
Because a single hydrogen bond typically has an energy between 15 
and 35 kilojoules per mole, hydrogen bonding can bea strong driver 
of molecular assembly; this is apparent from the dominant role of 
hydrogen bonding in nucleic-acid base pairing, as well as in the 
secondary structure of proteins. Carboxylic acid functional groups, 
which provide two hydrogen bonds, are particularly promising and 
reliable in creating and maintaining surface order, and self-assembled 
monolayers of benzoic acids produce structure that depends on the 
number and relative placement of carboxylic acid groups” *. Here 
we use scanning tunnelling microscopy to study self-assembled mono- 
layers of ferrocenecarboxylic acid (FCCOOH), and find that, rather 
than producing dimeric or linear structures typical of carboxylic 
acids, FCCOOH forms highly unusual cyclic hydrogen-bonded pen- 
tamers, which combine with simultaneously formed FCCOOH dimers 
to form two-dimensional quasicrystallites that exhibit local five-fold 
symmetry and maintain translational and rotational order (without 
periodicity) for distances of more than 400 angstréms. 

Quasicrystals—first observed experimentally in 1982—exhibit long- 
range, non-periodic order, and may contain extended regions with ‘for- 
bidden’ (5, 8, 10 or 12-fold) rotational symmetry’*. The majority of 
known quasicrystals are binary or ternary metal alloys such as Al\CuMn, 
or are materials derived from these alloys by adsorption or epitaxy”’°. 
Apart from metal alloys, other quasicrystalline systems discovered include 
terpolymer systems", liquid crystals and micelles'*’’, and metal’* and 
silica’* nanoparticle assemblies. FC>COOH, then, represents an entirely 
new class of quasicrystalline materials, and is the only one in which 
molecular self-assembly results in five-fold symmetry. 

The compound ferroceneacetic acid (FCCH,COOH) differs from 
FcCOOH only by the addition of a methylene group between the fer- 
rocene and the carboxylic acid (Fig. 1a) and it shows no evidence of 
pentamer formation, instead forming monolayers of close-packed dimers. 
Molecular-resolution images of FECOOH and FeCH,COOH mono- 
layers on gold (Au(111)) surfaces acquired using scanning tunnelling 
microscopy (STM) are presented in Fig. 1b (FCCH,COOH) and Fig. 1c, d 
(FcCOOH). The FCCH,COOH monolayer is ordered and regular, 
consistent with the formation of doubly hydrogen-bonded dimers, 
followed by dense packing of these dimers into a lattice. STM resolu- 
tion of ferrocene groups in covalently bound dimers depends on the 
relative orientation between the ring system and the surface, with rings 
parallel to the surface creating much brighter features, and perpendi- 
cularly oriented ferrocene much dimmer". The presence of both bright 
and dim features in Fig. 1b shows that FE>CH,COOH, too, has a range 
of adsorption conformations. Interpretation of the monolayer as con- 
sisting of rows of dimers results in an Fe-Fe hydrogen bonding dis- 
tance of 11.3A and an intermolecular spacing of 6.7 A, consistent 
with the condensed-phase distances determined crystallographically””. 


The FCCOOH monolayer, however, has no apparent long-range peri- 
odicity. Nearly all bright molecular features are grouped into pentagons, 
though there are some dimers and other defects evident. In addition to 
areas such as these, where pentamers cover the surface with nearly a 
complete monolayer, we have also observed areas of the surface with 
lower adsorbate density; at these lower coverages, isolated pentamers 
are present (Extended Data Fig. 1a). This result is at odds with the pref- 
erence for dimer formation observed for monolayers of aliphatic and 
benzoic carboxylic acids*°, the structure of the FECH,COOH mono- 
layer shown here and the known three-dimensional crystal structure of 
solid FCCOOH (ref. 17). 

Catemers are chains of carboxylic acids in which each molecule 
hydrogen-bonds to two others; for some acids, catemers constitute either 
the lowest-energy structure or a metastable structure resulting in lattice 
polymorphism”’. The degree of hydrogen bonding is the same as for 
dimer formation, although catemers are generally disfavoured because 
of non-optimal OH: *:O bond angles, as well as for entropic reasons. 
We propose that the FECOOH pentamers observed are cyclic catemers 
(Fig. 2b). Cyclization would at first seem to create even more bond strain 
in OH:::O hydrogen bonds, and impose a higher cost in entropy. 
However, this is offset by an additional favourable interaction between 
the carbonyl group and the hydrogen atom on the second position of 


FcCH,COOH and FcCOOH acid. b, c, Rows of dimers are observed for 
FcCHzCOOH (b), whereas the structure of FC>COOH (c) is dominated 

by pentamers; both images are 205 AX205A. d, Higher-resolution, 

110A X 100A image showing submolecular structure in the F©COOH Cp 
rings; dimmer features packed between pentamers also appear more clearly 
in this image. 
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Figure 2 | Calculations and proposed structures for FECOOH dimers and 
pentamers. a, Calculated energies for FE>COOH dimers as a function of 
bond angle /COH (red) compared with a similar calculation for HCOOH 
dimers (blue). Both molecules have the same deep well for the doubly bonded 
dimer, but only FCCOOH has the additional well at /COH = 225° from 
CH: *:O hydrogen bonding. Thin vertical lines show the angles necessary 

for formation of cyclic structures with three to six FECOOH molecules. 

b, Minimum-energy structure of (FC>COOH)s calculated by DFT. c, Model 
structure, with the central pentamer surrounded by five FCCOOH dimers; the 
pentamer has Cp rings parallel to the surface and appears bright in the images, 
whereas the perpendicular Cp rings of the dimer result in much dimmer 
features. The model is compared with the image in Fig. 1d, scaled and rotated, 
and slightly stretched to correct for distortion in the STM image. 


the cyclopentadienyl (Cp) ring. Bringing these atoms into proximity 
allows for the formation of a CH’ **O hydrogen bond. Such bonds are 
typically less than 10 kJ mol’ in strength, and although they are not 
generally considered in small-molecule interactions, they are known to 
influence lattice ordering in crystals, and are important in the structure 
and function of large biomolecules’? Although CH:*:O hydrogen 
bonds are not evident in the crystal structure of FCCOOH, there is 
evidence that CH***N bonding occurs in the solid-state structure of 
FcCN (ref. 20). 

We performed density functional theory (DFT) calculations to deter- 
mine whether CH-*-O interactions were sufficient to offset the bend- 
ing of OH*:O bonds. In the first set of calculations, the geometry of 
a FcCOOH dimer with two hydrogen bonds was optimized, and the 
minimum-energy structure found to have a 124.3° angle between the 
intermolecular hydrogen bond and the C=O bond direction of either 
carbonyl. The angle “/COH was varied to determine its effect on the 
dimer energy, constraining the dimer to be bent while re-optimizing all 
other aspects of the geometry. The results are shown in Fig. 2a. The 
dimer geometry has a deep well resulting from two hydrogen bonds; as 
the angle increases, one of these bonds is broken and the other strained, 
with energy increasing until the angle reaches 195°. At this point, the 
favourable CH: :O interaction outweighs the cost of further bending 
the OH--:O bond, leading to a 2.5kJ mol | well at ZCOH = 225°. 
This closely matches the 224.8° angle necessary to form a cyclic pen- 
tamer. Furthermore, the COH bond angles for cyclic tetramers or hex- 
amers are displaced from this energy minimum, which supports the 
experimentally observed predominance of five-membered rings. We 
repeated the energy calculations using second-order Moller—Plesset 
perturbation theory with the DFT-optimized geometry, and found an 
angular dependence similar to that seen in Fig. 2, including a local 
minimum at 225°. 
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The blue dotted line in Fig. 2a shows a similar calculation for formic 
acid (HCOOH) dimers, with the curve scaled to overlay the FKCOOH 
dimer curve. The functional forms of these curves are nearly identi- 
cal between /COH = 125° and 200°, indicating that the underlying 
processes—the breaking of one hydrogen bond and the bending of the 
other—are largely independent of the rest of the molecule. The well 
caused by CH:*-O hydrogen bonding, however, is completely absent 
for formic acid, because the lone hydrogen on the formic acid carbon is 
never close enough to an oxygen to participate in any intermolecular 
interaction. 

The CH:::O well at /COH = 225° in Fig. 2a appears shallow, but 
this is in part because it is a calculation for a strained dimer rather than 
for a full cyclic pentamer. The strained dimer makes one OH: **O hydro- 
gen bond and one CH:::O bond, but leaves unbonded the C=O on 
one molecule and the COH on the other. In the cyclic pentamer, all 
C=O and COH groups participate in hydrogen bonding, potentially 
doubling the cohesive energy per molecule relative to what is shown in 
Fig. 2a. To determine more accurately the relative stability of the dimer 
versus the cyclic catemers, we performed additional DFT calculations 
to obtain energies and optimized geometries for FECOOH dimers and 
for hydrogen-bonded rings containing three to six molecules (Extended 
Data Table 1). In all cases, the pentamer was found to be more stable 
than any other cyclic structure; of these, the hexamer was the next most 
stable. The relative stability of the dimer and pentamer depends on the 
choice of functionals used for the calculation, with B3LYP and PBE 
predicting the dimer to be more stable and M06-L and M06-2X favour- 
ing the pentamer. The two structures are close enough in energy—less 
than 2kJ mol! for all functionals except M06-2X—that both should 
be observable given that kgT (kp, Boltzmann’s constant; T, temperature) 
ranges from 0.64 to 2.49kJ mol’, depending on the point between 
preparation (298 K) and imaging (77 K) when the structure becomes 
kinetically frozen. 

For FCCH,COOH molecules, the length of the additional methylene 
group prevents any interaction between the carboxyl oxygen and the Cp 
hydrogens. If other factors—for example steric packing of the ferrocene 
Cp rings—were responsible for pentamer formation, both FCCOOH 
and FCCH,COOH would aggregate into pentamers in a similar fashion. 
In contrast, the observation of dimers and a complete absence of pen- 
tamers for FC>CH,COOH provides strong corroborating evidence that 
it is the CH: :O bonds that stabilize FE>COOH pentamers. Other expla- 
nations, such as registry to the underlying surface, also do not explain 
the observations: the Au(111) substrate has hexagonal symmetry, and 
to the extent that benzene remains on the surface, it packs in a rect- 
angular lattice under our experimental conditions”. Furthermore, the 
isolated pentamers observed in Extended Data Fig. la are randomly 
oriented in two dimensions, which rules out the corrugation of the 
substrate as a driving cause of pentamer formation. 

In addition to pentamers, the F;COOH monolayer contains a number 
of other bright features that can be resolved in Fig. 1c, d, including dimers 
and groups of dimers as well as individual bright features. There is also 
a large number of less bright features between pentamers, which are 
very dim in Fig. 1c and clearer in Fig. 1d. As for the dim FeCH,COOH 
molecules in Fig. 1b, we interpret these as FE;COOH molecules with Cp 
rings tilted with respect to the surface. A possible structural model is 
shown in Fig. 2c: the central pentamer is surrounded by five dimers, which 
well match the observed dim features. This arrangement creates mul- 
tiple CH’ 1 interactions, which are known to drive ordering in similar 
systems’. The dimers thus act as ‘glue’ between adjacent pentamers— 
whereas the brighter pentamer features seem to align vertex to vertex, 
the dimers create an effectively larger pentagonal unit that then packs 
edge to edge with its neighbours. 

We note that the molecular arrangement in Fig. 1c seems to have a 
single chirality; the effect is subtle but consistent, and the chirality reverses 
for the area imaged in Fig. 1d. For neighbouring pentamers aligned 
vertex to vertex, the two pentagons are offset so that the pair has point 
symmetry but not plane symmetry; that is, the pentagons’ symmetry 
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axes may be parallel but not collinear. This observation is explained in 
the model, because the trans-configuration of the dimers in the model 
in Fig. 2c create handedness that will propagate to create long-range 
chirality in the monolayer. In some cases, monolayer defects create 
grain boundaries that interrupt the long-range structure. Both orienta- 
tion and chirality can change across these boundaries; an example is 
shown in Extended Data Fig. 5, along with diagrams to illustrate the 
chiral structure. 

The packing of pentamers to form a FCCOOH monolayer results 
in quasicrystalline structure, which is analysed in Fig. 3. In Fig. 3a, a 
405 A x 360A image of a FECOOH monolayer is low-pass-filtered 
to de-emphasize the internal structure of the pentamers and thus emphas- 
ize the longer-range order (the raw data are presented in Extended 
Data Fig. 5d). Each pentamer is then overlaid with a pentagon that— 
relative to the pentagon formed with molecules at vertices—was expanded 
in size and rotated by 180°. The image is patterned but has no apparent 
periodicity, and the pentagons and their interstitial spaces match the 
shapes in a Penrose ‘PI’ tiling” (pentagon, rhombus, boat and star), 
with only a few small gaps in evidence. A number of intermediate-range 
structures exactly mirror those of a Penrose tiling, which is drawn in 
Fig. 3b for comparison. As in the Penrose tiling, orientational ordering 
of pentagons in the experimental image is nearly perfect, with only two 
directions observed; this is also presented quantitatively in Extended 
Data Figs 1b and 2. 

Local five-fold and ten-fold symmetry of the FE>COOH monolayer 
can be seen in the two-dimensional Fourier transform in Fig. 3c and 
the two-dimensional spatial correlation function in Fig. 3d; the data 
set used to produce both of these is in Extended Data Fig. 1b. The 
Fourier transform (Extended Data Fig. 3a) has ten-fold-symmetric 
peaks at a number of radii, and both the sharpness and the symmetry 
are characteristic of quasicrystalline ordering. The spatial correlation 
function (Extended Data Fig. 3b) maps displacement vectors between 
every pair of pentagons in an image; it, too, shows ten-fold symmetry 
as well as long-range translational order. This order is quasiperiodic, 
however, because it corresponds to a sum of multiple frequencies related 
by irrational ratios: in this case, powers of the golden ratio, g = (1 + \5)/2 
(Extended Data Fig. 4). Quasiperiodicity and orientational order can 
be considered the essential properties of quasicrystals*. FCCOOH 


RSs : 2 
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Figure 3 | Comparison of a FECOOH monolayer structure to a P1 Penrose 
tiling. a, 405 A X 360 A image, overlaid with pentagons to show the long- 
range, quasicrystalline order. b, Groups of pentagons highlighted in white in 
a show some structural motifs common to both the experimental data and a 
Penrose P1 tiling; the interstitial spaces between pentamers in a also match the 
star, boat and rhombus of the P1 tiling. c, Two-dimensional Fourier transform; 
the innermost ring arises from long-range five-fold and ten-fold symmetry. 
d, This symmetry can also be seen in the two-dimensional spatial correlation 
function. 
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monolayers are also quasicrystalline by an alternative definition, which 
requires an essentially sharp diffraction pattern and the absence of 
translational periodicity”. The relatively small size of ordered regions 
(>400 A) is fairly typical for self-assembled monolayers with hexa- 
gonal or other allowed symmetry. FE;COOH might be more accurately 
described, therefore, as polyquasicrystalline, and the regions imaged 
here, as quasicrystallites; this is largely a matter of terminology, however. 

Differences between the Penrose tiling and the FECOOH monolayer 
include the small gaps described above and the presence of relatively 
more rhombuses and fewer stars, as well as the prevalence of pentagons 
with four other pentagons immediately adjacent. In a Penrose tiling, 
pentagons may have two, three or five immediate neighbours, and two 
adjacent pentagons cannot have the same number of neighbours. Repli- 
cating these constraints would be difficult with pairwise interactions, 
and this probably explains differences between the monolayer and Penrose 
structures. However, the intermediate-range order of the monolayer 
matches that of the Penrose tiling quite well. 

Maximal packing cannot be the sole driving force in the formation 
of a quasicrystalline monolayer. Higher-density packing of pentagons 
is possible with periodic lattices; indeed, other investigations of the self- 
assembly of molecules with C; symmetry uniformly have found mono- 
layers with periodic lattice structures**”’, including a complex (but still 
periodic) Archimedean tiling”*. Random packing of pentagons with the 
constraint that edges and vertices must match, however, produces 
patterns that appear much more like quasicrystals”’. The structures 
in ref. 29 are more dendritic and less dense than those we observe for 
FcCOOH. Physically, we would expect thermal motions during deposi- 
tion to allow for some reorientation, increasing the packing density 
until the overall structure either reaches a free-energy minimum or 
becomes kinetically locked. The result is similar to that predicted when a 
Metropolis algorithm is used to anneal structure after random growth”. 

The spontaneous pentamer formation observed for FCCOOH will 
probably also occur for molecules functionalized at the 3 and 4 positions— 
that is, not immediately adjacent to the COOH group; some functio- 
nalizations of the other Cp ring may also not hinder pentamer formation. 
There is potentially a large range of novel supramolecular assemblies 
based on these pentameric building blocks, in which the chemistry of 
side groups can control the interaction of pentamers and influence the 
formation of crystalline or quasicrystalline structure. 


METHODS SUMMARY 


Solutions of approximately 10 mM of FcCOOH and FcCH,COOH acid were 
prepared in an Ar-purged glovebox, using benzene as the solvent. Small quantities 
of the ferrocene derivatives were loaded into a pulsed solenoid valve and deposited 
in vacuum onto an Ar-sputtered and annealed Au(111)-on-mica sample at room 
temperature (~295 K). Once deposited, the sample was immediately transferred 
into an ultrahigh-vacuum chamber (10 '° Torr) at 77 K, and STM images were 
acquired under constant-current conditions. Image filtering (high pass in the scan 
direction, or low pass overall) was done in some cases to decrease noise, but no 
other frequency-selective filter was used. DFT calculations were performed using 
the Q-CHEM software package*!. The 6-311+ +G** basis set was used for all C, H 
and O atoms, and the LANL2TZ effective core potential basis set was used for all Fe 
atoms. 


Online Content Any additional Methods, Extended Data display items and Source 
Data are available in the online version of the paper; references unique to these 
sections appear only in the online paper. 
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Rate of tree carbon accumulation increases 
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Forests are major components of the global carbon cycle, providing 
substantial feedback to atmospheric greenhouse gas concentrations’. 
Our ability to understand and predict changes in the forest carbon 
cycle—particularly net primary productivity and carbon storage— 
increasingly relies on models that represent biological processes 
across several scales of biological organization, from tree leaves to 
forest stands’. Yet, despite advances in our understanding of pro- 
ductivity at the scales of leaves and stands, no consensus exists about 
the nature of productivity at the scale of the individual tree*’, in 
part because we lack a broad empirical assessment of whether rates 
of absolute tree mass growth (and thus carbon accumulation) decrease, 
remain constant, or increase as trees increase in size and age. Here we 
present a global analysis of 403 tropical and temperate tree species, 
showing that for most species mass growth rate increases continu- 
ously with tree size. Thus, large, old trees do not act simply as se- 
nescent carbon reservoirs but actively fix large amounts of carbon 
compared to smaller trees; at the extreme, a single big tree can add 
the same amount of carbon to the forest within a year as is contained 
in an entire mid-sized tree. The apparent paradoxes of individual 
tree growth increasing with tree size despite declining leaf-level**° 
and stand-level’® productivity can be explained, respectively, by 
increases in a tree’s total leaf area that outpace declines in produc- 
tivity per unit of leaf area and, among other factors, age-related 
reductions in population density. Our results resolve conflicting 
assumptions about the nature of tree growth, inform efforts to under- 
tand and model forest carbon dynamics, and have additional impli- 
cations for theories of resource allocation’' and plant senescence”. 

A widely held assumption is that after an initial period of increasing 
growth, the mass growth rate of individual trees declines with increas- 
ing tree size**’*-’*. Although the results of a few single-species studies 
have been consistent with this assumption”, the bulk of evidence cited 
in support of declining growth is not based on measurements of indi- 
vidual tree mass growth. Instead, much of the cited evidence documents 
either the well-known age-related decline in net primary productivity 
(hereafter ‘productivity’) of even-aged forest stands” (in which the trees 
are all ofa similar age) or size-related declines in the rate of mass gain per 
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unit leaf area (or unit leaf mass)*"’, with the implicit assumption that 


declines at these scales must also apply at the scale of the individual tree. 
Declining tree growth is also sometimes inferred from life-history theory 
to bea necessary corollary of increasing resource allocation to reproduc- 
tion'’’®. On the other hand, metabolic scaling theory predicts that mass 
growth rate should increase continuously with tree size®, and this pre- 
diction has also received empirical support from a few site-specific 
studies®’. Thus, we are confronted with two conflicting generalizations 
about the fundamental nature of tree growth, but lack a global assess- 
ment that would allow us to distinguish clearly between them. 

To fill this gap, we conducted a global analysis in which we directly 
estimated mass growth rates from repeated measurements of 673,046 
trees belonging to 403 tropical, subtropical and temperate tree species, 
spanning every forested continent. Tree growth rate was modelled as a 
function of log(tree mass) using piecewise regression, where the inde- 
pendent variable was divided into one to four bins. Conjoined line 
segments were fitted across the bins (Fig. 1). 

For all continents, aboveground tree mass growth rates (and, hence, 
rates of carbon gain) for most species increased continuously with tree 
mass (size) (Fig. 2). The rate of mass gain increased with tree mass in 
each model bin for 87% of species, and increased in the bin that included 
the largest trees for 97% of species; the majority of increases were sta- 
tistically significant (Table 1, Extended Data Fig. 1 and Supplementary 
Table 1). Even when we restricted our analysis to species achieving the 
largest sizes (maximum trunk diameter > 100 cm; 33% of species), 94% 
had increasing mass growth rates in the bin that included the largest 
trees. We found no clear taxonomic or geographic patterns among the 
3% of species with declining growth rates in their largest trees, although 
the small number of these species (thirteen) hampers inference. Declin- 
ing species included both angiosperms and gymnosperms in seven of 
the 76 families in our study; most of the seven families had only one or 
two declining species and no family was dominated by declining spe- 
cies (Supplementary Table 1). 

When we log-transformed mass growth rate in addition to tree mass, 
the resulting model fits were generally linear, as predicted by metabolic 
scaling theory® (Extended Data Fig. 2). Similar to the results of our main 
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Figure 1 | Example model fits for tree mass growth rates. The species shown 
are the angiosperm species (Lecomtedoxa klaineana, Cameroon, 142 trees) (a) 
and gymnosperm species (Picea sitchensis, USA, 409 trees) (b) in our data 
set that had the most massive trees (defined as those with the greatest 
cumulative aboveground dry mass in their five most massive trees). Each point 
represents a single tree; the solid red lines represent best fits selected by our 
model; and the dashed red lines indicate one standard deviation around the 
predicted values. 
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analysis using untransformed growth, of the 381 log-transformed spe- 
cies analysed (see Methods), the log-transformed growth rate increased 
in the bin containing the largest trees for 96% of species. 

In absolute terms, trees 100 cm in trunk diameter typically add from 
10 kg to 200 kg of aboveground dry mass each year (depending on species), 
averaging 103 kg per year. This is nearly three times the rate for trees of 
the same species at 50cm in diameter, and is the mass equivalent to 
adding an entirely new tree of 10-20 cm in diameter to the forest each 
year. Our findings further indicate that the extraordinary growth recently 
reported in an intensive study of large Eucalyptus regnans and Sequoia 
sempervirens’, which included some of the world’s most massive indi- 
vidual trees, is not a phenomenon limited to a few unusual species. Rather, 
rapid growth in giant trees is the global norm, and can exceed 600 kg 
per year in the largest individuals (Fig. 3). 

Our data set included many natural and unmanaged forests in which 
the growth of smaller trees was probably reduced by asymmetric com- 
petition with larger trees. To explore the effects of competition, we cal- 
culated mass growth rates for 41 North American and European species 
that had published equations for diameter growth rate in the absence of 
competition. We found that, even in the absence of competition, 85% 
of the species had mass growth rates that increased continuously with tree 
size (Extended Data Fig. 3), with growth curves closely resembling those 
in Fig. 2. Thus, our finding of increasing growth not only has broad 
generality across species, continents and forest biomes (tropical, subtropical 
and temperate), it appears to hold regardless of competitive environment. 

Importantly, our finding of continuously increasing growth is com- 
patible with the two classes of observations most often cited as evidence 
of declining, rather than increasing, individual tree growth: with increas- 
ing tree size and age, productivity usually declines at the scales of both 
tree organs (leaves) and tree populations (even-aged forest stands). 

First, although growth efficiency (tree mass growth per unit leaf area 
or leaf mass) often declines with increasing tree size*'®, empirical 
observations and metabolic scaling theory both indicate that, on aver- 
age, total tree leaf mass increases as the square of trunk diameter’””*. A 
typical tree that experiences a tenfold increase in diameter will therefore 
undergo a roughly 100-fold increase in total leaf mass and a 50-100-fold 
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Figure 2 | Aboveground mass growth rates for the 403 tree species, by 
continent. a, Africa (Cameroon, Democratic Republic of the Congo); b, Asia 
(China, Malaysia, Taiwan, Thailand); c, Australasia (New Zealand); d, Central 
and South America (Argentina, Colombia, Panama); e, Europe (Spain); and 


f, North America (USA). Numbers of trees, numbers of species and percentages 
with increasing growth are given in Table 1. Trunk diameters are approximate 
values for reference, based on the average diameters of trees of a given mass. 
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Table 1 | Sample sizes and tree growth trends by continent 


Continent Number of trees Number of species Percentage of species with increasing mass growth rate in the largest trees 
(percentage significant at P< 0.05) 
Africa 15,366 37 100.0 (86.5) 
Asia 43,690 136 96.3 (89.0) 
Australasia 45,418 22 95.5 (95.5) 
Central and South America 18,530 77 97.4 (92.2) 
Europe 439,889 42 90.5 (78.6) 
North America 110,153 89 98.9 (94.4) 
Total 673,046 403 96.8 (89.8) 


The largest trees are those in the last bin fitted by the model. Countries are listed in the legend for Fig. 2. 


increase in total leaf area (depending on size-related increases in leaf 
mass per unit leaf area’?”°). Parallel changes in growth efficiency can 
range from a modest increase (such as in stands where small trees are 
suppressed by large trees)”' to as much as a tenfold decline”’, with most 
changes falling in between*”'?””. At one extreme, the net effect of a low 
(50-fold) increase in leaf area combined with a large (tenfold) decline in 
growth efficiency would still yield a fivefold increase in individual tree 
mass growth rate; the opposite extreme would yield roughly a 100-fold 
increase. Our calculated 52-fold greater average mass growth rate of 
trees 100 cm in diameter compared to those 10cm in diameter falls 
within this range. Thus, although growth efficiency often declines with 
increasing tree size, increases in a tree’s total leaf area are sufficient to 
overcome this decline and cause whole-tree carbon accumulation rate 
to increase. 

Second, our findings are similarly compatible with the well-known 
age-related decline in productivity at the scale of even-aged forest stands. 
Although a review of mechanisms is beyond the scope of this paper”, 
several factors (including the interplay of changing growth efficiency 
and tree dominance hierarchies”) can contribute to declining produc- 
tivity at the stand scale. We highlight the fact that increasing individual 
tree growth rate does not automatically result in increasing stand pro- 
ductivity because tree mortality can drive orders-of-magnitude reduc- 
tions in population density”*”°. That is, even though the large trees in 
older, even-aged stands may be growing more rapidly, such stands 
have fewer trees. Tree population dynamics, especially mortality, can 
thus bea significant contributor to declining productivity at the scale of 
the forest stand”. 

For a large majority of species, our findings support metabolic scal- 
ing theory’s qualitative prediction of continuously increasing growth 


Mass growth rate (Mg yr") 
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Figure 3 | Aboveground mass growth rates of species in our data set 
compared with E. regnans and S. sempervirens. For clarity, only the 58 
species in our data set having at least one tree exceeding 20 Mg are shown 
(lines). Data for E. regnans (green dots, 15 trees) and S. sempervirens (red dots, 
21 trees) are from an intensive study that included some of the most massive 
individual trees on Earth’. Both axes are expanded relative to those of Fig. 2. 
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at the scale of individual trees®, with several implications. For example, 
life-history theory often assumes that tradeoffs between plant growth 
and reproduction are substantial’’. Contrary to some expectations’””*, 
our results indicate that for most tree species size-related changes in 
reproductive allocation are insufficient to drive long-term declines in 
growth rates®. Additionally, declining growth is sometimes considered 
to bea defining feature of plant senescence’*. Our findings are thus rele- 
vant to understanding the nature and prevalence of senescence in the 
life history of perennial plants”’. 

Finally, our results are relevant to understanding and predicting 
forest feedbacks to the terrestrial carbon cycle and global climate system’”. 
These feedbacks will be influenced by the effects of climatic, land-use 
and other environmental changes on the size-specific growth rates and 
size structure of tree populations—effects that are already being observed 
in forests**”’. The rapid growth of large trees indicates that, relative to 
their numbers, they could play a disproportionately important role in 
these feedbacks*’. For example, in our western USA old-growth forest 
plots, trees >100 cm in diameter comprised 6% of trees, yet contrib- 
uted 33% of the annual forest mass growth. Mechanistic models of the 
forest carbon cycle will depend on accurate representation of produc- 
tivity across several scales of biological organization, including calibra- 
tion and validation against continuously increasing carbon accumulation 
rates at the scale of individual trees. 


METHODS SUMMARY 


Weestimated aboveground dry mass growth rates from consecutive diameter mea- 
surements of tree trunks—typically measured every five to ten years—from long- 
term monitoring plots. Analyses were restricted to trees with trunk diameter 
210cm, and to species having =40 trees in total and =15 trees with trunk diameter 
230 cm. Maximum trunk diameters ranged from 38 cm to 270 cm among species, 
averaging 92 cm. We converted each diameter measurement (plus an accompany- 
ing height measurement for 16% of species) to aboveground dry mass, M, using 
published allometric equations. We estimated tree growth rate as G = AM/At and 
modelled Gas a function of log(M) for each species using piecewise regression. The 
independent variable log(M) was divided into bins and a separate line segment was 
fitted to G versus log(M) in each bin so that the line segments met at the bin divi- 
sions. Bin divisions were not assigned a priori, but were fitted by the model sepa- 
rately for each species. We fitted models with 1, 2, 3 and 4 bins, and selected the 
model receiving the most support by Akaike’s Information Criterion for each 
species. Our approach thus makes no assumptions about the shape of the rela- 
tionship between G and log(M), and can accommodate increasing, decreasing or 
hump-shaped relationships. Parameters were fitted with a Gibbs sampler based on 
Metropolis updates, producing credible intervals for model parameters and growth 
rates at any diameter; uninformative priors were used for all parameters. We tested 
extensively for bias, and found no evidence that our results were influenced by 
model fits failing to detect a final growth decline in the largest trees, possible biases 
introduced by the 47% of species for which we combined data from several plots, or 
possible biases introduced by allometric equations (Extended Data Figs 4 and 5). 


Online Content Any additional Methods, Extended Data display items and Source 
Data are available in the online version of the paper; references unique to these 
sections appear only in the online paper. 
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Sensory-motor transformations for speech occur 


bilaterally 


Gregory B. Cogan’, Thomas Thesen?”, Chad Carlson”+, Werner Doyle’, Orrin Devinsky** & Bijan Pesaran! 


Historically, the study of speech processing has emphasized a strong 
link between auditory perceptual input and motor production out- 
put'*. A kind of ‘parity’ is essential, as both perception- and pro- 
duction-based representations must form a unified interface to 
facilitate access to higher-order language processes such as syntax 
and semantics, believed to be computed in the dominant, typically 
left hemisphere**. Although various theories have been proposed to 
unite perception and production”’, the underlying neural mechan- 
isms are unclear. Early models of speech and language processing 
proposed that perceptual processing occurred in the left posterior 
superior temporal gyrus (Wernicke’s area) and motor production 
processes occurred in the left inferior frontal gyrus (Broca’s area)*”. 
Sensory activity was proposed to link to production activity through 
connecting fibre tracts, forming the left lateralized speech sensory- 
motor system’. Although recent evidence indicates that speech 
perception occurs bilaterally", prevailing models maintain that 
the speech sensory-motor system is left lateralized’’**"* and facil- 
itates the transformation from sensory-based auditory representa- 
tions to motor-based production representations’’*'*, However, 
evidence for the lateralized computation of sensory-motor speech 
transformations is indirect and primarily comes from stroke 
patients that have speech repetition deficits (conduction aphasia) 
and studies using covert speech and haemodynamic functional 
imaging'®”’. Whether the speech sensory-motor system is lateralized, 
like higher-order language processes, or bilateral, like speech per- 
ception, is controversial. Here we use direct neural recordings in 
subjects performing sensory-motor tasks involving overt speech 
production to show that sensory-motor transformations occur 
bilaterally. We demonstrate that electrodes over bilateral inferior 
frontal, inferior parietal, superior temporal, premotor and soma- 
tosensory cortices exhibit robust sensory-motor neural responses 
during both perception and production in an overt word-repetition 
task. Using a non-word transformation task, we show that bilateral 
sensory-motor responses can perform transformations between 
speech-perception- and speech-production-based representations. 
These results establish a bilateral sublexical speech sensory-motor 
system. 

To investigate the sensory-motor representations that link speech 
perception and production, we used electrocorticography (ECoG), in 
which electrical recordings of neural activity are made directly from 
the cortical surface in a group of patients with pharmacologically intrac- 
table epilepsy. ECoG is an important electrophysiological signal record- 
ing modality that combines excellent temporal resolution with good 
spatial localization. Critically for this study, ECoG data contain lim- 
ited artefacts due to muscle and movements during speech production 
compared with non-invasive methods that suffer artefacts with jaw 
movement’. Thus, using ECoG we were able to investigate directly 
neural representations for sensory-motor transformations using overt 
speech production. 

Sixteen patients with subdural electrodes (see Supplementary Figs 1 
and 2) implanted in the left hemisphere (6 subjects), right hemisphere 


(7 subjects) or both hemispheres (3 subjects) performed variants of an 
overt word repetition task designed to elicit sensory-motor activations 
(Fig. 1a, Methods and Supplementary Table 1). We observed increases 
in neural activity across the high gamma frequency range (60-200 Hz 
and above) with maximal activity across subjects between 70-90 Hz. 
High gamma activity reflects the spiking activity of populations of 
neurons during task performance”°”’. Individual electrodes showed 
one of three types of task responses: sensory-motor (S-M), production 
(PROD), or auditory (AUD) (Fig. 1b, see Methods). We found that 
AUD activity was generally localized to the superior temporal gyrus and 
middle temporal gyrus (42 out of 57 electrodes (74%); Fig. 2a, b) and 
PROD activity occurred mostly in the motor and premotor corticies, 
somatosensory cortex, and the inferior parietal lobule (98 out of 124 
electrodes (79%); Fig. 2a, b), consistent with previous models and 
results of speech-perception and -production studies'’’?””. Furthermore, 
electrical stimulation of PROD electrode locations resulted in orofacial 
movements consistent with a motor function (see Supplementary 


a Listen-speak Listen—mime Listen 
e 

Auditory 4 4 i vs 4. 
Motor — —____ ss ____ 

b “oO 1 2 3. 

Time from auditory onset (s, 
Sensory— 
motor 


“ay . a Je 


euljaseq wou 
@SEBJOU! JBMOd X 


Time from auditory onset (s) 


Figure 1 | Behavioural tasks and example neural activations. a, 16 Subjects 
were presented with an auditory consonant—vowel-consonant single-syllable 
word and instructed to perform one of three tasks on interleaved trials: 
listen-speak (listen to the word, visual prompt ‘Listen’, then after a 2-s delay 
repeat the word, visual prompt ‘Speak’); listen—mime (listen to the word, visual 
prompt ‘Listen’, then after a 2-s delay, mime speaking the word, visual prompt 
“‘Mime’); listen (passively listen to the word, visual prompt ‘: = :). Auditory 
and motor timelines are shown. b, Example time-frequency spectrograms of 
ECoG activity normalized at each frequency to the baseline power during visual 
prompt. AUD, significant activity during each task epoch with auditory stimuli; 
PROD, significant activity during both production epochs; S-M, significant 
activity during the auditory and production epochs in listen-speak and 
listen-mime tasks. 


Center for Neural Science, New York University, New York, New York 10003, USA. Department of Neurology, New York University School of Medicine, New York, New York 10016, USA. *Department of 
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a Individual 


Figure 2 | Topography of neural responses and bilateral activation. 

a, Significant task-related activations within individual subject brains for left 
(subjects $3, S4, $7, S8, S11, S14), right (S1, $2, S5, $6, $9, $12, S15), or both 
(S10, S13, $16) hemispheres. Bilateral coverage is indicated by the light blue 
box. Electrodes with significant high gamma activity (70-90 Hz) are shown for 
AUD (green), PROD (blue) and S-M (red) activations. AUD and S-M 
activations (red with green) were often present on the same electrode. 
Electrodes without significant activation are shown in grey. Triangles denote 
example activations from Fig. 1b, and squares (S16) denote example 


Fig. 3). Critically, contrary to one of the core dogmas of brain and 
language, S-M activity occurred bilaterally in the supramarginal gyrus, 
middle temporal gyrus, superior temporal gyrus, somatosensory cor- 
tex, motor cortex, premotor cortex and inferior frontal gyrus (Fig. 2a, b, 
49 electrodes; see Supplementary Table 2 and Supplementary Fig. 4) 
and was observed in all subjects (Fig. 2a). Of the 49 S-M sites, 45 
sites showed auditory activation during the ‘listen task’ (Fig. 2a, b; 
Supplementary Figs 4 and 5; 45 out of 49 electrodes (approximately 
92%)), suggesting a role in speech perception. Hemispheric dominance 
as determined by Wada testing did not correlate with the hemisphere 
of the electrode placement (77 (3) = 0.92, P = 0.34). Importantly, in three 
subjects with bilateral coverage, S-M activity was present on electrodes 
in both hemispheres (Fig. 2a, c) and the likelihood of an electrode being a 
S-M site did not differ between hemispheres (Fisher’s exact test, P = 0.31). 
These results demonstrate that S-M activity occurs bilaterally. 

Given the evidence for bilateral S-M activity, we performed a series 
of analyses and experimental manipulations to test the hypothesis that 
bilateral S-M activity is in fact sensory-motor and represents sensory— 
motor transformations for speech. 

One concern is that S-M activity is not due to sensory and motor 
processes but to sensory activation in both auditory (input) and production 
epochs (sound of your own voice). We observed several convergent 
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se.m. values across electrodes. Go cue and average production response onset 
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lines of evidence that S-M activity reflects both sensory and motor 
processing (see Fig. 2d and Methods). First, S-M sites contain a sensory 
response because they responded to auditory stimulation as rapidly as 
AUD sites (S-M latency = 158 ms, AUD = 164 ms; see Fig. 2d). Second, 
S-M responses during production are not due to auditory sensory reaf- 
ferent input from hearing one’s own voice because responses were 
present during the ‘listen—mime task’ as well as the ‘listen—speak task’. 
Third, S-M responses during production are not due to somatosensory 
reafference from moving articulators because S-M activity significantly 
increased within 248 ms of the production ‘go’ cue, whereas vocal responses 
occurred substantially later at 1,002 ms (+40 ms s.e.m.). Fourth, S-M 
production responses contain motor features because they occurred 
together with, and even before, PROD electrode responses (S-M = 248 
ms, PROD = 302 ms, Q = 0.03; permutation test; see Methods). Finally, 
S-M activity was persistently elevated during the delay period (P = 0.01; 
see Fig. 2d, Methods), broadly consistent with planning activity, unlike 
PROD delay-period activity (P = 0.64) or AUD delay-period activity 
(P = 0.53). These results demonstrate that S-M activity cannot be sim- 
ply sensory and spans both sensory and motor processes. 

A related concern is that sensory-motor transformations are first car- 
ried out in the left hemisphere. If so, S-M responses in the right hemisphere 
could be due to communication from the left hemisphere. To test this 
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hypothesis, we further examined latencies of S-M responses according 
to hemisphere. Response latencies did not differ significantly in each 
hemisphere in either the auditory (right hemisphere, 156 ms; left hemi- 
sphere, 182 ms; Q=1 X 10 *; permutation test) or the production 
epoch (right hemisphere, 272 ms; left hemisphere, 268 ms; Q=1 X 
10 *; see Methods). Therefore, right hemisphere responses cannot be 
due to computations that were first carried out in the left hemisphere and 
the data do not support strictly lateralized sensory-motor computations. 

Another concern is that S-M activity may not reflect speech proces- 
sing and may also be present during simple auditory inputs and oro- 
facial motor outputs. To test this, we employed a ‘tone-move task’ in 
one of the bilaterally implanted subjects (subject 13 (S13); see Methods). 
We found that S-M electrodes did not have significant sensory-motor 
responses during the tone-move task (P = 0.36, permutation test; see 
Supplementary Fig. 6). Thus, S-M activity is specific to mapping sounds 
to goal-directed vocal motor acts and is likely specific to speech (see 
Supplementary Discussion 1.3). 

Thus far we have shown the S-M activity is bilateral, sensory-motor, 
and likely to be specific to speech. However, an important open ques- 
tion is whether S-M responses reflect the transformation that links 
speech perception and production and can support a unified perception- 
production representational interface. A specific concern is that high 
gamma ECoG activity may pool heterogeneous neural responses 
beneath the electrode. S-M responses may combine activity from neu- 
rons which encode perceptual processes active during the auditory cue 
and other neurons which encode production processes active during 
the utterance. If this is true, none of the activity necessarily reflects a 
sensory-motor transformation that links perception and production. 
To be able to rule out this alternative and demonstrate that S-M res- 
ponses are involved in sensory-motor transformations, we reasoned 
that two requirements must be met. S-M activity must encode informa- 
tion about the content of the underlying speech processes, and this encod- 
ing must reflect transformative coding between the sensory input and 
motor output. 

To test whether S-M activity encodes information about speech 
content, we decoded the neural activity to predict, on each trial, what 
the subjects heard and said. We used seven consonant-vowel-consonant 
words (heat, hit, hat, hoot, het, hot and hut) and trained a seven- 
way linear classifier to decode the neural responses (see Methods). 
Individual electrodes only weakly encoded speech content, but when 
we decoded activity pooled across groups of electrodes, we found that 
all three electrode groups encoded speech tokens (see Fig. 3). AUD elec- 
trodes performed best with an average classification performance of 
42.7% (y? (1) = 56.5,P = 6 X 10°“), followed by S-M electrodes, which 
showed performance of 33.4% (x? (1) = 25.6, P= 4 X 10”), and then 
PROD electrodes, which showed performance of 27.1% (x? (1) = 
11.5,P=7 X 10 *). Furthermore, classification performance for S-M 
electrodes did not differ between the two hemispheres (left hemisphere, 
29%; right hemisphere, 27%; Fisher’s exact test, P = 0.5; Fig. 3c). Thus, 
bilateral S-M activity encodes information about the sensory and motor 
contents of speech, meeting an important requirement for sensory- 
motor transformations. 

We next sought to test whether S-M activity can link speech percep- 
tion and production by transforming auditory input into production 
output. The essential requirement for transformation is that neural encod- 
ing of sensory input should depend on subsequent motor output. Previous 
work has characterized visual-motor transformations using a transforma- 
tion task in which the spatial location of a visual cue can instruct a motor 
response to the same or different spatial location (the ‘pro-anti task’)”””?. 
Sensory—motor neurons in the dorsal visual stream display different 
responses to the visual cue depending on the motor contingency, demon- 
strating a role for these neurons in the visual-motor transformation”. 

Given these predictions from animal neurophysiology, we tested four 
subjects as they performed an auditory-motor transformation task (the 
listen-speak transformation task) that employed two non-words (kig, 
pob) to examine whether S-M activity has a role in transformations for 
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Figure 3 | Neural decoding of words. a, Confusion matrices show proportion 
identified for a seven-way linear classifier using neural responses. AUD 
electrodes (top), PROD electrodes (middle) and S-M electrodes (bottom) are 
shown. The threshold for performance is at chance level, P = 0.14, for the 
purposes of displaying the electrodes clearly. b, Classification performance for 
increasing numbers of electrodes. Chance performance is indicated by the 
dotted line. c, Classification performance for S-M electrodes in the left and right 
hemispheres. Methods present S-M results by response epoch. 


speech (see Fig. 4A, Supplementary Figs 7 and 8 and Methods). This 
task enabled us to hold the sensory and motor components constant 
while manipulating the transformation process itself in order to mea- 
sure how the encoding of this content changed depending on how 
perceptual input was mapped onto production output. The use of 
non-words instead of words offered other advantages. Non-words 
enabled us to examine sublexical transformations for speech and could 
be designed to differ maximally in their articulatory dimensions and 
their neural representations (see Methods and Supplementary Dis- 
cussion 1.1 and 1.2). 

At least three models describe how neural responses encode the task 
variables. If responses follow a strictly sensory model, the encoding will 
follow the content of the sensory inputs and confuse trial conditions in 
which kig is converted to kig (kig—kig) with trials in which kig is con- 
verted to pob (kig—pob), as well as trials in which pob is converted to 
pob (pob—pob) with trials in which pob is converted to kig (pob—kig) 
(see Fig. 4Ba). Conversely, responses that follow a strictly motor model 
will encode the production outputs, confusing kig—>kig with pob—kig 
trials and pob—pob with kig—pob trials (see Fig. 4Bb). If S-M responses 
pool responses from sensory and motor neurons, the encoding will 
follow the sensory model during sensory input and the motor model 
during motor output. In contrast, S-M responses that reflect the trans- 
formation of sensory input into motor output must follow a different 
transformation model and encode the sensory information differently 
depending on the upcoming motor act (see Fig. 4Bc). Neural activity 
displaying this property could compute a representational transforma- 
tion (see Supplementary Discussion 1.1, 1.2). If so, responses that follow 
a transformation model will not confuse trial conditions with either 
identical input or identical output. Consequently, each of the three models 
predicted very different patterns of neural coding. 

We constructed linear classifiers to decode neural responses. As 
expected, AUD electrodes in the auditory epoch encoded the auditory 
input (Fig. 4Ba, Ca) and PROD electrodes encoded the output during 
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Figure 4 | Listen-speak transformation task. A, In the listen—speak 
transformation task, subjects have to transform a non-word they hear into a 
non-word they speak according to a simple rule. Subjects were first 
presented with a visual cue: ‘Match Listen’ or “Mismatch Listen’ that instructed 
the rule that determined the non-word to say in response to the non-word 
they heard. On ‘match trials’ the rule was to repeat the non-word they heard. 
On ‘mismatch trials’, the rule was to say the non-word that they did 

not hear. The non-words were ‘kig’ and ‘pob’. Subjects then heard one of the 


the production epoch (utterance; Fig. 4Bb, Cb). However, S-M elec- 
trodes changed their encoding over the course of the trial. During the 
auditory epoch, S-M electrodes encoded both sensory and motor con- 
ditions concurrently, consistent with the presence of a sensory-motor 
transformation (Fig. 4Bc, Cc). Interestingly, during the production epoch, 
S-M responses no longer encoded the auditory input and encoded the 
production output (Fig. 4Cd), suggesting the transformation has lar- 
gely been computed by that time. To quantify the comparison of dif- 
ferent models, we used the Kullback—Leibler divergence (see Fig. 4Da-d, 
Methods). The results were consistent with the response patterns in the 
confusion matrices. 

We can also rule out that the difference in S-M responses is due to a 
third population of neurons that selectively responds to the cue instruc- 
ting how perceptual input was mapped onto production output (‘match’ 
or ‘mismatch’). We ran the same linear classifier during cue presentation 
and found that the S-M responses did not encode the cue (v7 (1) = 0.08, 
P =0.78; see Methods). 

Using direct brain recordings (ECoG) and overt speech, we demon- 
strate that a sensory-motor system for transforming sublexical speech 
signals exists bilaterally. Our results are in keeping with models of speech 
perception that posit bilateral processing but contradict models that 
posit lateralized sensory-motor transformations'’’*. Our results also 
highlight how S-M activity during perceptual input reflects the trans- 
formation of speech sensory input into motor output. We propose that the 
presence of such transformative activity demonstrates a unified sensory- 
motor representational interface that links speech-perception- and 
speech-production-based representations. Such an interface is import- 
ant during speech articulation, acquisition and self-monitoring ”*. As 
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two non-words, waited for a short delay, then said the appropriate 

non-word in response to the ‘Speak’ cue. There were four task conditions: 
kig—kig (hear ‘kig’ and say ‘kig’); pob—pob (hear ‘pob’ and say ‘pob’); 
kig—pob (hear ‘kig’ and say ‘pob’); and pob—kig (hear ‘pob’ and say ‘kig’). 
B, a-c, Confusion matrices predicted by the sensory, motor and transformation 
models with high and low classification scores. C, a-d, Confusion matrices 
during the listen—-speak transformation task. D, a-d, Model fit quantified using 
a Kullback-Leibler (K-L) divergence. 


right hemisphere lesions do not give rise to conduction aphasia'*””-”, 


our evidence for bilateral sensory-motor transformations promotes an 
interesting distinction between speech and language: although sens- 
ory-motor transformations are bilateral, the computational system for 
higher-order language is lateralized** (see Supplementary Fig. 9). This 
hypothesis invokes a strong interface between sensory-based speech- 
perception representations and motor-based speech-production rep- 
resentations and suggests that deficits for conduction aphasia are more 
abstract and linguistic in nature. We propose that bilateral sublexical trans- 
formations could support a unification of perception- and production- 
based representations into a sensory-motor interface’, drawing a 
distinction between the bilateral perception-production functions of 
speech and lateralized higher order language processes. 


METHODS SUMMARY 


Electrocorticographic (ECoG) recordings were obtained from 16 patients (10 females) 
undergoing treatment for pharmacologically resistant epilepsy. Each patient pro- 
vided informed consent in accordance with the Institutional Review Board at New 
York University Langone Medical Center. Grid implantation was in the left hemi- 
sphere (6 subjects), right hemisphere (7 subjects) or both hemispheres (3 subjects). 
All 16 subjects performed an overt word repetition task (listen-speak task) as well 
as two control tasks (listen-mime task’*® and listen task). One subject also per- 
formed a tone-move task. Four subjects also performed a listen-speak transforma- 
tion task involving non-words. ECoG recordings were made using both grid and 
strip electrode arrays with 2.3-mm contact size and 10-mm spacing. Spectral analysis 
was performed using 500-ms analysis windows with +5-Hz frequency smoothing 
and a stepping size of 50 ms. Neural responses were defined as high gamma neural 
activity between 70 and 90 Hz and significance was assessed using a shuffling 
procedure. Classification analyses were carried out using a linear discriminant 
analysis with high gamma power spectral features. 


6 MARCH 2014 | VOL 507 | NATURE | 97 


©2014 Macmillan Publishers Limited. All rights reserved 


LETTER 


Online Content Any additional Methods, Extended Data display items and Source 
Data are available in the online version of the paper; references unique to these 
sections appear only in the online paper. 


Received 25 September; accepted 2 December 2013. 
Published online 15 January 2014. 


1. Liberman, A. M., Cooper, F. S., Shankweiler, D. P. & Studdert-Kennedy, M. 
Perception of the Speech Code. Psychol. Rev. 74, 431-461 (1967). 

2. Liberman, A. M. & Mattingly, |. G. The motor theory of speech perception revised. 
Cognition 21, 1-36 (1985). 

3. Halle, M. & Stevens, K. N. Speech recognition: a model and a program for research. 
IEEE Trans. Inf. Theory 8, 155-159 (1962). 

4. Halle, M. & Stevens, K. N. Analysis by synthesis In (eds Wathen-Dunne, W. & Woods, 
L. E.) Proceedings of seminar on speech compression and processing Vol. 2 paper D7 
1959). 

5. Berwick, R.C., Friederici, A. D., Chomsky, N. & Bolhuis, J.J. Evolution, brain, and the 

nature of language. Trends Cogn. Sci. 17, 89-98 (2013). 

Chomsky, N. The Minimalist Program (MIT Press, 1995). 

Jakobson, R. Child Language, Aphasia and Phonological Universals (Mouton, 1968). 

Lichtheim, L. On aphasia. Brain 7, 433-484 (1885). 

Wernicke, C. The aphasic symptom-complex: a psychological study on an 

anatomical basis. Arch. Neurol. 22, 280-282 (1970). 

10. Geschwind, N. Disconnexion syndromes in animals and man. |. Brain 88, 237-294 
(1965). 

11. Hickok, G. & Poeppel, D. The cortical organization of speech processing. Nature 
Rev. Neurosci. 8, 393-402 (2007). 

12. Price, C. J. The anatomy of language: a review of 100 fi 
2009. Ann. NY Acad. Sci. 1191, 62-88 (2010). 

13. Obleser, J., Eisner, F. & Kotz, S. A. Bilateral speech comprehension reflects 
differential sensitivity to spectral and temporal features. J. Neurosci. 28, 
8116-8123 (2008). 

4. Rauschecker, J. P. & Scott, S. K. Maps and streams in the auditory cortex: 
nonhuman primates illuminate human speech processing. Nature Rev. Neurosci. 
12, 718-724 (2009). 

5. Hickok, G., Houde, J. & Rong, F. Sensorimotor integration in speech processing: 
computational basis and neural organization. Neuron 69, 407-422 (2011). 

6. Hickok, G., Okada, K. & Serences, J. T. Area Spt in the human planum temporale 
supports sensory-motor integration for speech processing. J. Neurophysiol. 101, 
2725-2732 (2009). 

7. Guenther, F. H. Cortical interactions underlying the production of speech sounds. 
J. Commun. Disord. 39, 350-365 (2006). 

8. Wise, R. J. S. etal. Separate neural subsystems within ‘Wernicke’s area’. Brain 124, 
83-95 (2001). 

19. Caramazza, A., Basili, A. G., Koller, J. J. & Berndt, R. S. An investigation of repetition 

and language processing in a case of conduction aphasia. Brain Lang. 14, 235-271 
(1981). 


WOON D 


RI studies published in 


98 | NATURE | VOL 507 | 6 MARCH 2014 


20. Crone, N.E., Sinai, A. & Korzeniewska, A. Event-related dynamics of brain 
oscillations. Prog. Brain Res. 159, 275-295 (2006). 

21. Markowitz, D.A., Wong, Y. T., Gray, C. M. & Pesaran, B. Optimizing the decoding of 
movement goals from local field potentials in macaque cortex. J. Neurosci. 31, 
18412-18422 (2011). 

22. Zhang, M. & Barash, S. Neuronal switching of sensorimotor transformations for 
antisaccades. Nature 408, 971-975 (2000). 

23. Gail, A. & Andersen, R. Neural dynamics in monkey parietal reach region reflect 
context-specific sensorimotor transformations. J. Neurosci. 26, 9376-9384 
(2006). 

24. Chang, E. F., Niziolek, C.A, Knight, R. T., Nagarajan, S. S. & Houde, J. F. Human 
cortical sensorimotor network underlying feedback control of vocal pitch. Proc. 
Nat! Acad. Sci. USA 110, 2653-2658 (2013). 

25. Oller, D.K., Eilers, R.E. & Oiler, D. K. The role of audition in infant babbling the role of 
audition in infant babbling. Child Dev. 59, 441-449 (1988). 

26. Agnew, Z. K., McGettigan, C., Banks, B. & Scott, S. K. Articulatory 
movements modulate auditory responses to speech. Neuroimage 73, 191-199 
(2013). 

27. Goodglass, H., Kaplan, E. & Barresi, B. Assessment of Aphasia and Related Disorders 
(Lippincott Williams & Wilkins, 2000). 

28. Damasio, H. & Damasio, A. R. The anatomical basis of conduction aphasia. Brain 
103, 337-350 (1980). 

29. Benson, D. F. et al. Conduction aphasia: a clinicopathic study. Arch. Neurol. 28, 
339-346 (1973). 

30. Murphy, K. etal. Cerebral areas associated with motor control of speech in humans 
cerebral areas associated with motor control of speech in humans. J. Appi. Physiol. 
83, 1438-1447 (1997). 


Supplementary Information is available in the online version of the paper. 


Acknowledgements We would like to thank A. Weiss, J. MacArthur and L. Frank for 
developing the data acquisition hardware and software; O. Felsovalyi, E. Londen, 

P. Purushothaman, L. Melloni, C. Boomhaur and A. Trongnetrpunya for technical 
assistance; and D. Poeppel and C. Brody for comments on the manuscript. This work 
was supported, in part, by RO3-DCO10475 from the NIDCD, a Career Award in the 
Biomedical Sciences from the Burroughs Wellcome Fund (B.P.), a Watson Investigator 
Program Award from NYSTAR (B.P.), a McKnight Scholar Award (B.P.) and a Sloan 
Research Fellowship (B.P.). 


Author Contributions G.B.C. designed the experiment, performed the research, 
analysed the data and wrote the manuscript. T.T. and O.D. performed the research and 
wrote the manuscript. C.C. and W.D. performed the research. B.P. designed the 
experiment, performed the research, analysed the data and wrote the manuscript. 


Author Information Reprints and permissions information is available at 
www.nature.com/reprints. The authors declare no competing financial interests. 
Readers are welcome to comment on the online version of the paper. Correspondence 
and requests for materials should be addressed to B.P. (bijan@nyu.edu). 


©2014 Macmillan Publishers Limited. All rights reserved 


METHODS 


Participants. Electrocorticographic (ECoG) recordings were obtained from 16 
patients (6 males, 10 females; see Supplementary Table 1) with pharmacologically 
resistant epilepsy undergoing clinically motivated subdural electrode recordings at 
the New York University School of Medicine Comprehensive Epilepsy Center. 
Informed Consent was obtained from each patient in accordance with the Institutional 
Review Board at the New York University Langone Medical Center. Patient selec- 
tion for the present study followed strict criteria: first, cognitive and language abilities 
in the average range or above, including language and reading ability, as indicated 
by formal neuropsychological testing (see Supplementary Table 1); and second, 
normal language organization as indicated by cortical stimulation mapping, when 
available. In addition, only electrode contacts outside the seizure onset zone and 
with normal interictal activity were included in the analysis. 

Behavioural tasks and recordings. All participants performed three behavioural 
tasks: listen—speak, listen-mime and listen (Fig. 1a). Behavioural tasks were per- 
formed while participants reclined in a hospital bed. Tasks were controlled by a 
computer placed on the service tray on the bed running the Presentation program 
(NeuroBehavioural Systems). Behavioural audio recordings were either synchro- 
nized with the neural recordings at 10 kHz (see below) or recorded on the com- 
puter and referenced to the go cue. For a subset of subjects, a video camera with 
built-in microphone (Sony) was positioned to monitor subject orofacial move- 
ments and utterances. Video was streamed to disk (Adobe Premier Pro. (video at 
29.95 frames per s, and audio at 44.1 kHz)). Audio-visual and neural signals were 
synchronized video-frame-by-video-frame using an Analogue-to-Digital Video 
Converter (Canopus). 

Listen-speak, listen-mime* and listen tasks were randomly interleaved on a 
trial-by-trial basis with at least 4 s between trials. Each trial began with a visual cue 
presented, followed by the auditory consonant-vowel-consonant (CVC) token 
500 ms later. We used CVC words composed of the same consonants, ‘h’ and ‘t, 
and different vowels (hat, hit, heat, hoot, het, hut, hot). These spoken syllables span 
the vowel space and differ in their auditory and articulatory content. Subjects had 
to either listen passively (listen), repeat the syllable after a cue (listen—speak) or 
mime the syllable after a different cue (listen-mime, produce the appropriate 
mouth movements but with no vocal cord vibration; see Supplementary Fig. 10). 
The temporal delay between the auditory cue and the movement cue was 2 s. We 
obtained between 49 and 166 trials per condition (within subject) and between 175 
and 334 total trials per subject. 

For the tone-move task (see Supplementary Fig. 6), after the listen cue was 
delivered, a 500-ms, 1,000-Hz sinusoidal tone (with 100-ms on and off ramps) was 
presented. After a short, 2-s delay another visual cue was presented (move) 
instructing the subject to move their articulators (tongue, lips and jaw). For one 
subject, these trials were randomly interleaved within blocks of the listen-speak, 
listen—mime and listen tasks (see above). 

For the listen-speak transformation task, four subjects (see Supplementary 

Figs 7 and 8) were first presented with one of two visual cues: “Match Listen’ or 
“Mismatch Listen’. After a delay, subjects heard one of two non-words: ‘kig’ (/kIg/) 
or ‘pob’ (/pab/). These non-words were chosen to differ maximally on their articu- 
lator dimensions: ‘kig’ contains a velar (back) voiceless stop consonant, followed 
by a high front vowel and finally a velar voiced stop consonant, and ‘pob’ contains 
a bilabial (front) voiceless stop consonant followed by a back low vowel and then a 
bilabial front voiced stop consonant. The tongue movement therefore goes back to 
front to back for ‘kig’ and front to back to front for ‘pob’. The reason for choosing 
maximally different articulations was that larger articulator differences might lead 
to larger neural activity differences. After a short delay (randomized between 1.5 
and 2 s), another visual cue was presented (‘Speak’) to which subjects were to 
respond by saying the match non-word they had heard if they had seen the initial 
match cue, or say the mismatch non-word if they had seen the mismatch cue. Each 
non-word within each condition was presented between 63 and 78 times per 
subject, with total trials ranging from 255 to 309 per subject. This control was 
carried out in separate blocks trials that alternated with blocks of the main listen- 
speak, listen-mime and listen tasks. 
Surface reconstruction and electrode localization. To localize electrode record- 
ing sites, pre-surgical and post-surgical T1-weighted magnetic resonance imaging 
(MRI) scans were obtained for each patient and co-registered with each other’’. 
The co-registered images were then normalized to an MNI-152 template and 
electrode locations were then extracted in MNI (Montreal Neurological Institute) 
space (projected to the surface) using the co-registered image, followed by skull 
stripping’. A three-dimensional reconstruction of each patient’s brain was com- 
puted using FreeSurfer (Fig. 2; $2, S3, $4, S5, S6, S7, S8 and S10 (ref. 33). For 
Supplementary Table 2, Talairach coordinates were converted from MNI space 
using the EEG/MRI toolbox in Matlab (http://sourceforge.net/projects/eeg/, GNU 
General Public License). 
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Neural recordings and preprocessing. EEG data were recorded from intracra- 
nially implanted subdural electrodes (AdTech Medical Instrument Corp.) in 
patients undergoing elective monitoring of pharmacologically intractable seizures. 
Electrode placement was based entirely on clinical grounds for identification of 
seizure foci and eloquent cortex during stimulation mapping, and included grid 
(8 X 8 contacts), depth (1 X 8 contacts) and strip (1 X 4 to 1 X 12 contacts) 
electrode arrays with 10-mm inter-electrode spacing centre-to-centre. Subdural 
stainless steel recording grid and strip contacts were 4 mm in diameter; conse- 
quently the distance between contacts was 6 mm and they had an exposed 2.3-mm 
diameter recording contact. 

For 7 of the 16 subjects, neural signals from up to 256 channels were amplified 

(X10, INA121 Burr-Brown instrumentation amplifier), bandpass filtered between 
0.1-4,000 Hz and digitized at 10 kHz (NSpike, Harvard Instrumentation Laboratories) 
before being continuously streamed to disk for off-line analysis (custom C and 
Matlab code). The front-end amplifier system was powered by sealed lead acid 
batteries (Powersonic) and optically isolated from the subject. After acquisition, 
neuronal recordings were further low-pass filtered at 800 Hz and down-sampled 
offline to 2,000 Hz for all subsequent analysis. For the remaining 9 subjects, neural 
signals from up to 128 channels were recorded on a Nicolet One EEG system, 
bandpass-filtered between 0.5-250 Hz and digitized at 512 Hz. In some recordings, 
modest electrical noise was removed using line-filters centred on 60, 120 and 180 
Hz (ref. 34). 
Data Analysis. For activation analysis, time-frequency decomposition was per- 
formed using a multi-taper spectral analysis**. The power spectrum was calculated 
during a 500-ms analysis window with + 5 Hz frequency smoothing stepped 50 ms 
between estimates. Single trials were removed from the analysis if the raw voltage 
exceeded eight standard deviations from the across trial pool, and noisy channels 
were removed from the analysis by visual inspection or if they did not contain at 
least 60% of the total trials after the standard deviation threshold removal. 

Sensory—motor transformations were defined as activity in the gamma range 
(70-90 Hz) that followed the auditory stimulus as well as the production cue 
during both listen-speak and listen-mime (Fig. 1b). As the example responses 
illustrate, some electrodes showed consistent increases in activity in the high 
gamma band as high as 300 Hz. As the frequency extent varied across subjects, 
we chose to focus on the 70-90-Hz frequency range as this band showed the 
greatest activation consistently across all subjects. Similar results were obtained 
when a broader frequency range extending up to 150 Hz was analysed. Although 
the listen-mime condition does involve altering the motor plan (no vocal cord 
vibration), sensory-motor activations were based on the conjunction of activity in 
both the listen—speak and the listen-mime conditions. Any neural activity that was 
specific to the listen-mime condition and not present in ‘normal’ speaking con- 
ditions was therefore excluded (see Supplementary Fig. 10). 

Responses were divided into three types. The first response type, auditory 
(AUD), was defined as containing a response that was seen within 250-750 ms 
following the onset of the auditory stimulus in all three conditions (Fig. 1b, top). 
The second response type, production (PROD), was characterized as containing a 
response occurring between 500-1,000 ms after the respond cue in the listen- 
speak and the listen-mime conditions (Fig. 1b, middle). The last response type, 
S-M, contained both post stimulus and a post response cue activation in both the 
listen-speak and the listen-mime conditions (Fig. 1b bottom). The baseline period 
was defined as the 500 ms preceding the auditory stimulus. 

In Fig. 1b, the experimental epoch was defined as —500 ms (pre) to 3,500 ms 
post auditory stimulus onset. In Fig. 2c the experimental epoch was defined as 
—500 ms (pre) to 4,000 ms post auditory stimulus onset. The additional 500 ms 
was included in Fig. 2c to compensate for slightly later production responses for 
that the subject. Power in each frequency band was normalized to the power in the 
baseline period by dividing by the power at each frequency. As the neural res- 
ponses had variable onset times but were on average quite long in duration, the 
times were chosen to sample adequately all the responses under investigation. 

To assess statistical significance, the average power across trials was taken in two 
time regions of interest for each trial within each condition. For the listen con- 
dition, the baseline values for each trial were shuffled with the post auditory values 
10,000 times to create a null distribution. For the listen-speak and the listen-mime 
conditions, both the post-auditory and the post-production values were shuffled 
10,000 times with the baseline values to create two null distributions. Initial sig- 
nificance was assessed using a permutation test by comparing the actual difference 
between the post auditory and post production values with the shuffled data 
differences**. To correct for multiple comparisons, for all subjects, all three con- 
ditions and both analysis epochs (listen (post auditory), listen—speak (post aud- 
itory and post production) and listen—mime (post auditory and post production)) 
were pooled together and a false discovery rate (FDR) analysis was performed with 
an alpha threshold set at 0.05 (ref. 36). 
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The population latency analysis was performed using the baseline-corrected 
high gamma power response profiles for each electrode within each response class 
(S-M, AUD and PROD). The high gamma neural responses were first bandpass 
filtered (70-90 Hz) and then averaged within conditions. The listen-speak and 
listen—mime conditions were averaged together. As the data were recorded using 
two different sampling rates, the data were resampled to a 500 Hz sampling rate. 
To test for latencies within a response class, the latencies following either the 
auditory onset or the go cue were compared against the activity in the listen 
condition following the go cue by computing a permuted distribution for each 
time point. The significance values at each time point were then corrected for 
multiple comparisons using a FDR set with an alpha of 0.05. The first time point 
that was followed by at least 20 consecutive significant time points (40 ms) was 
taken to be the latency of the neural response. This resulted in four significant 
latency values. In the auditory epoch, AUD electrodes had significant neural 
responses at 164 ms and S-M electrodes had significant responses at 158 ms. 
During the production epoch, PROD electrodes had significant responses starting 
at 302 ms, whereas S-M electrodes had significant responses starting at 248 ms. A 
similar analysis was carried out comparing the left S-M electrodes with the right 
S-M electrodes, which resulted in four more significant latency values: right hemi- 
sphere (auditory 156 ms), left hemisphere (auditory 182 ms), right hemisphere 
(production 272 ms) and left hemisphere (production 268 ms). A direct compar- 
ison between these latencies within each task epoch using FDR-corrected shuffle 
tests (see above) revealed no significant results. 

To assess whether or not during the auditory and production epochs, the S-M 
electrodes display significantly faster neural responses than the AUD and PROD 
electrodes, we repeated the permutation test, except instead of using the compar- 
ison of the task compared to the ‘Listen’ condition, we compared the S-M electro- 
des to the AUD electrodes in the auditory epoch and the S-M electrodes to the 
PROD electrodes in the production epoch. The results showed that whereas S-M 
and AUD electrodes did not differ in their latency values during the auditory 
epoch, S-M electrodes were significantly faster than PROD electrodes in the pro- 
duction epoch. 

To test for power differences of the high gamma response (70-90 Hz) across 
hemispheres, we performed FDR-corrected permutation tests. Data were analysed 
by averaging a 300-ms time window, sliding 50 ms between estimates. The data 
were baseline-corrected (average —500 ms (pre) to 0 ms pre-stimulus activity 
across conditions, within electrodes) and then log-transformed before analysis. 
For each condition (listen-speak, listen-mime and listen) and within each hemi- 
sphere (left and right), we computed the task epoch responses by computing the 
average of the high gamma response during the auditory epoch (0-1,000 ms post 
auditory onset) and during the production epoch (0-1,500 ms post production cue 
onset). We then performed a series of permutation tests where we permuted the 
neural response across condition and/or across hemisphere, correcting for mul- 
tiple comparisons using a FDR procedure. Only four tests produced significant 
results: listen-speak versus listen during the production epoch in each hemi- 
sphere, and listen—mime versus listen during the production epoch in each hemi- 
sphere. Furthermore, the neural responses within all conditions were not different 
across hemispheres (see Supplementary Fig. 11, P > 0.05, FDR corrected). 

To assess the significant delay activation for each electrode class, a permutation 
test was carried out using filtered data as listed above. A permutation test was 
performed for each electrode class in which the average high gamma neural 
activity of the delay period (1-2 s post auditory onset) was compared to that of 
the baseline period (— 1s to —0.5s pre auditory onset). Although PROD electrodes 
and AUD electrodes did not display elevated population neural activity (P = 0.64 
and 0.53, respectively), S-M electrodes had significantly higher elevated delay 
activity compared to baseline (P = 0.01; see Fig. 2d). 

Classification was performed using the single value decomposition (SVD) of the 
high gamma neural response (70-160-Hz, 300-ms sliding windows with an over- 
lap of 250 ms) in either the auditory epoch (0-1,000 ms post auditory onset, AUD 
electrodes) or the production epoch (0-1,500 ms post go cue, PROD electrodes) or 
both (S-M electrodes). A linear discriminant analysis (LDA) classification was 
performed using a leave-one-out validation method, in which the training set 
consisted of all the trials of the data set except the one being tested. Note that 
analysing the different task epochs separately for the S-M electrodes produced 
classifier results that were also significantly above chance (auditory epoch, 40.2% 
(77 (1) = 47, P=7 X 10), production epoch, 23.2% (x? (1) = 5.6, P = 0.02)). 


To create the cumulative curves, the number of electrodes inputted into the 
classifier was increased linearly. To control for the variability in trial numbers, the 
minimum number of trials common to all subjects and electrodes was used. One- 
hundred iterations for each number of cumulative electrodes were performed, in 
which the specific trials and the specific electrodes were varied randomly and the 
number of SVD components was equal to the number of electrodes inputted to the 
classifier for the AUD and S-M electrodes, whereas five components were used for 
the PROD electrodes due to a lower number of components present in the PROD- 
electrode data. 

Confusion matrix scores are simply the proportion of trails classified as the 
token on the horizontal axis (decoded) given that the actual trial came from the 
vertical axis (actual). Confusion matrices in Fig. 3a are shown for the largest number 
of cumulative electrodes in each electrode class. 

To analyse the listen-speak transformation task responses (Fig. 4), the same 
decomposition (SVD) of the neural signal (70-160 Hz) was used. Note that instead 
of a seven-way classifier, a four-way classifier was used. Confusion matrices (Fig. 4C) 
are shown for the largest number of cumulative electrodes in each electrode class 
(AUD = 10; PROD = 19; S-M = 8). For the S-M electrodes, each response epoch 
(auditory, Fig. 4Cc; production, Fig. 4Cd) was analysed separately. 

To measure the quality of each of the models (sensory, motor, sensory-motor or 
chance; Fig. 4d) we used the Kullback-Leibler divergence, which quantifies the 
amount of information lost in bits when Q (the model) is used to approximate P 
(the data): 


PU) 

Dxr(PI|Q= > 7, Plog, (32) 
where P is the classification percentage for each actual or decoded pair (see above) 
and Q is one of the four models: sensory, motor, sensory-motor and chance. The 
Kullback-Leibler divergence estimates the information distance between the pat- 
tern of classification errors predicted by each model, shown in Fig. 4B and the 
pattern of classification errors based on neural recordings, shown in Fig. 4C. 
Smaller Kullback-Leibler divergence reflects more information about classifica- 
tion errors and improved model fit. The sensory model (Fig. 4Ba) reflects clas- 
sification scores that track the auditory speech input such that in both the match 
and the mismatch cases, the same input will be confused with each other. 
Conversely, the motor model (Fig. 4Bb) reflects classification scores that track 
the production output so that the same outputs will be confused with one another. 
However, the sensory-motor model (Fig. 4Bc) will reflect both the input and 
output such that classifications for each of the conditions presented (kig—kig, 
pob—pob, kig—pob and pob—kig) will be classified correctly. Finally, the chance 
model will simply reflect chance performance in all cases (0.25). 

Classifier analysis of the cue data (‘Match Listen’ versus ‘Mismatch Listen’) in 
the listen-speak transformation task was analysed on the S-M electrodes for the 
subjects performing the task. The same linear classifier was used as above, but was 
performed during the cue period (0-1,000 ms post Cue) and was two-way (‘Match 
Listen’ versus ‘Mismatch Listen’ cues). The results demonstrated that the classifica- 
tion was not significant (mean classification = 52.3%, x” (1) = 0.08, P = 0.78). 
Furthermore, using the same two-way classifier between the match and mismatch 
condition during the auditory epoch was also not significant (mean classifica- 
tion = 56.4%, x? (1) = 0.72, P = 0.4). Taken together, this indicates that the sens- 
ory—motor transformations displayed by these electrodes cannot be due to a third 
population of neurons that code for the visual cue. 
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Ring chromosomes are structural aberrations commonly associated 
with birth defects, mental disabilities and growth retardation’”. 
Rings form after fusion of the long and short arms of a chromo- 
some, and are sometimes associated with large terminal deletions’. 
Owing to the severity of these large aberrations that can affect mul- 
tiple contiguous genes, no possible therapeutic strategies for ring 
chromosome disorders have been proposed. During cell division, 
ring chromosomes can exhibit unstable behaviour leading to con- 
tinuous production of aneuploid progeny with low viability and 
high cellular death rate*°. The overall consequences of this chromo- 
somal instability have been largely unexplored in experimental model 
systems. Here we generated human induced pluripotent stem cells 
(iPSCs)'*” from patient fibroblasts containing ring chromosomes 
with large deletions and found that reprogrammed cells lost the 
abnormal chromosome and duplicated the wild-type homologue 
through the compensatory uniparental disomy (UPD) mechanism. 
The karyotypically normal iPSCs with isodisomy for the corrected 
chromosome outgrew co-existing aneuploid populations, enabling 
rapid and efficient isolation of patient-derived iPSCs devoid of the 
original chromosomal aberration. Our results suggest a fundament- 
ally different function for cellular reprogramming as a means of 
‘chromosome therapy’ to reverse combined loss-of-function across 
many genes in cells with large-scale aberrations involving ring struc- 
tures. In addition, our work provides an experimentally tractable 
human cellular system for studying mechanisms of chromosomal 
number control, which is of critical relevance to human development 
and disease. 

We obtained fibroblasts from a Miller Dieker Syndrome (MDS) patient 
with ring chromosome 17, subsequently referred to as r(17). MDS is 
caused by heterozygous deletions of human band 17p13.3 (refs 14, 15) 
(Fig. 1a). This deletion alone leads to craniofacial dysmorphology, defec- 
tive neuronal migration, abnormal cortical layering and nearly absent 
cortical folding with devastating neurological consequences such as 
mental retardation and intractable epilepsy'*’®. However, in this case 
the 17p13.3 deletion was in a ring chromosome, and the patient had 
a typical MDS phenotype™. To separate the effects of r(17) from the 
17p13.3 deletion, we obtained fibroblasts from two additional MDS 
patients with similar deletions but without r(17) (Fig. 1b). 

Two critical genes deleted in MDS are PAFAH1B1] (encoding LIS1) 
and YWHAE (encoding 14-3-3¢) (Fig. 1a)’°. Consistent with this, MDS 
fibroblasts MDS1r(17), MDS2 and MDS3 (Fig. 1c) expressed reduced 
PAFAHI1B1 and YWHAE messenger RNA compared to control fibro- 
blasts (Fig. 1d, e). MDS1r(17) fibroblasts had a 46,XY,r1(17) karyotype 
in 95% of the cells (Figs 1i and 2a and Extended Data Fig. 1), with 
the remaining 5% exhibiting ring loss or secondary ring derivatives 


(Fig. 2b). 


To investigate the behaviour of ring chromosomes in actively pro- 
liferating cells, we generated iPSCs using non-integrating episomal 
vectors'’. All MDS iPSCs were morphologically indistinguishable from 
wild type (Fig. 1f) and expressed stem cell markers (Fig. 1g and Extended 
Data Fig. 2a—d). We confirmed that MDS iPSCs were free of exogenous 
factor integration (Extended Data Fig. 3a, b) and were functionally plu- 
ripotent, producing cell types of the three germ layers (Extended Data 
Figs 4 and 5). 

We then analysed six early-passage MDS11(17) iPSC clones for the 
presence of the ring (Fig. 1h) and, unexpectedly, found that four out of 
six clones did not have any detectable ring chromosomes (Fig. 1i), but 
grew well and had proper morphology (Extended Data Fig. 6a). In con- 
trast, two other clones with detectable ring chromosomes (Fig. 1i) differ- 
entiated or stopped growing upon subsequent passaging (Extended 
Data Fig. 6b, c). Analysis of chromosome composition revealed that 
stable clones had 46 chromosomes and no ring in 85-100% of cells, in 
contrast to <15% of cells in unstable clones (Fig. 1iand Extended Data 
Fig. 6d, e). These results suggested that r(17) was incompatible with 
reprogramming and/or stem cell maintenance using our methods. 

Further cytogenetic analysis of the first two MDS1r(17) iPSC clones 
demonstrated a normal 46,XY karyotype without r(17) (Fig. 2a—c). In 
addition, the deletion, which was readily detectible by Giemsa (G)- 
banding in MDS2 and MDS3 iPSCs, was not apparent in MDSIr(17) 
iPSCs (Fig. 2a and Extended Data Fig. 1). These findings could be 
explained by either clonal expansion of rare cells with a normal karyo- 
type from mosaic fibroblasts; or repair or replacement of the ring chro- 
mosome during or after reprogramming. We reasoned that presence of 
a small fraction of cells with the deleted r(17) in predominantly cor- 
rected iPSC clones would confirm their origin from an abnormal fibro- 
blast. To test this, we analysed interphase nuclei by fluorescence in situ 
hybridization (FISH) using a red probe to detect the sub-telomeric 
band 17p13.3 together with a green probe to detect the long-arm band 
17q21.32 (Fig. 2d, e). Consistent with karyotype data, =80% of cells 
in iPSC clones 1 and 2 demonstrated a normal signal pattern (2R2G) 
(Fig. 2f), suggesting that they have two intact copies of chromosome 
17. However, 10-20% of interphase iPSCs had a 1R2G signal pattern, 
indicative of r(17) with p13.3 deletion (Fig. 2f). These results revealed 
persistent mosaicism in MDS1r(17) iPSCs, confirming their origin 
from ring-containing fibroblasts and supporting the hypothesis that 
repair or replacement of the ring chromosome occurred during repro- 
gramming. Of note, despite several attempts (>120 cells analysed in 
multiple experiments), no evidence of r(17) was found in metaphase 
iPSCs, suggesting that in the pluripotent state such cells may be ter- 
minal and non-dividing. The expectation is that remaining interphase 
cells with r(17) would be gradually depleted. 
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Figure 1 | Reprogramming from fibroblasts with r(17) produces multiple polymerase chain reaction with reverse transcription (RT-qPCR) for 

iPSC clones that do not have the ring chromosome. a, Schematic of human PAFAHIB1 (d) and YWHAE (e) mRNA levels in fibroblasts (n = 2). f, Images 
chromosome 17 (chr 17), highlighting band 17p13.3, which is deleted in MDS. _ of WT, MDSir(17) and MDS2 iPSCs. g, Western blot analysis for pluripotency 
Encoded genes are listed on the right, and genes within the minimal critical markers in fibroblasts (Fib) and two iPSC clones. h, Examples of metaphase 
MDS deletion are shown in red. b, Schematic of chr 17 status in wild type(WT) __ spreads with or without the ring chromosome. i, Percentage of mitotic cells with 
and MDS cells. ¢, List of fibroblast lines used in this study. d, e, Quantitative or without the ring chromosome in fibroblasts or MDS1r(17) iPSC clones. 
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Figure 2 | Karyotypically normal cells predominate in early passage iPSC in d. f, Proportions of cells with various signal patterns (n = 200 each). 
clones derived from MDS1r(17) fibroblasts. a, Images of chr 17 pairs in g, h, qPCR for genomic PAFAH1B1 DNA in fibroblasts (g) and iPSCs (h). 
MDSir(17) fibroblasts and iPSC clones. The arrow is pointing to r(17) only AU, arbitrary units; n = 2. i, j, Western blot analysis for LIS1 and 14-3-3¢ in 
found in the fibroblasts. b, c, Proportion of MDS11(17) fibroblasts (b) andiPSC _ fibroblasts (i) (two samples per fibroblast line collected on different days) and 
clones 1 and 2 (c) with various karyotypes (n = 20 each). d, Approximate iPSCs (two clones per line) (j). 

position of FISH probes on chr 17. e, Signal patterns obtained with FISH probes 
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Cytogenetic and FISH results suggested that the MDS-associated 
deletion was rescued in MDS1r(17) iPSC clones 1 and 2. Consistent 
with that rescue, PAFAH1B1 genomic DNA levels were restored from 
50% of wild type in the original fibroblasts (Fig. 2g) to wild-type levels 
in iPSCs (Fig. 2h). Conversely, in MDS2 and MDS3 cells without r(17), 
PAFAHI1B1 genomic DNA levels were consistently reduced before and 
after reprogramming (Fig. 2g, h). Similar results demonstrating rescue 
were obtained by western blot analysis for LIS1 and 14-3-3¢ proteins 
(Fig. 2i, j). To investigate the status of the deleted locus before and after 
reprogramming in more detail, we performed single nucleotide poly- 
morphism (SNP) microarrays. Analysis of total copy number for SNPs 
on chromosome 17 showed a clear decrease between positions 1 and 
5,700,000 in MDSir(17) fibroblasts, which was not observed in the 
corresponding iPSCs (Fig. 3a, b). In contrast, large terminal deletions 
were apparent in iPSCs derived from MDS2 and MDS3 fibroblasts 
(Fig. 3c, d). Of note, the extent of the deletion in MDS3 cells was exactly 
the same as in MDS1r(17) fibroblasts, making it an ideal control for the 
effect of the ring. Therefore, 17p13.3 deletion is recapitulated in iPSCs, 
and does not severely compromise pluripotent stem cell growth, viabil- 
ity or function. In contrast, r(17) appears to compromise in vitro stem 
cell maintenance and proliferation, and iPSCs that lost or repaired the 
ring chromosome showed preferential survival. 

We proposed two possible mechanisms that could lead to loss of 
r(17) and rescue of the deleted locus. The first mechanism, called com- 
pensatory uniparental disomy (UPD), also referred to as monosomy 
correction, is known to occur during human development, and involves 
replacement of an abnormal chromosome with a copy of its normal 


a MDS1r(17) fibroblasts: del 1-5,700,000 c 


MDS2 clone 1 iPSCs: del 1—4,500,000 
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homologue’®. This is a two-step process that could proceed through 
loss of r(17) due to anaphase lag, followed by duplication of the wild 
type chromosome 17 by nondisjunction (Fig. 3e). The second mech- 
anism could occur through a ring opening owing to a double-strand 
break”, followed by homology-based repair or somatic recombination”*”’ 
with the wild-type homologue (Fig. 3e). In both cases selection is likely 
to favour cells with 46 normal chromosomes. However, in the first sce- 
nario we would expect both chromosomes 17 in iPSCs to be isodiso- 
mic; that is, completely homozygous, whereas in the second scenario 
we would expect partial loss of heterozygosity (LOH) at the repaired 
locus with the rest of chromosome 17 being heterozygous (Fig. 3e). To 
distinguish between these possibilities, we used data from SNP arrays 
to determine heterozygosity on chromosome 17 in fibroblasts and iPSCs. 
Consistent with compensatory UPD, the two homologues of chromo- 
some 17 were entirely homozygous in both iPSC clones derived from 
MDS11(17) cells, whereas the original fibroblasts as well as MDS2 and 
MDS3 iPSCs exhibited 27% heterozygosity (Fig. 3f). This was a specific 
effect for the corrected chromosome 17 (Fig. 3g). 

Non-random gain of a third chromosome 17 occurs frequently in 
long-term culture of human pluripotent cells owing to a proliferative 
growth advantage”. To test whether chromosomes not typically gained 
or implicated in advantageous stem cell growth can replace the corre- 
sponding ring chromosome in iPSCs, we reprogrammed fibroblasts 
from two additional cases involving ring chromosome 13, subsequently 
referred to as r(13), with large deletions (Fig. 4a, f) associated with mul- 
tiple congenital anomalies. We confirmed the presence of r(13) in approx- 
imately 80-100% of the fibroblasts, with the remaining cells exhibiting 


Figure 3 | Rescue of MDS-associated deletion in 
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Figure 4 | Derivation of iPSC clones with a normal karyotype from 
fibroblasts with r(13). a, Images of chr 13 pairs in fibroblasts and repaired 
iPSCs from two individuals with r(13). The arrows are pointing to r(13) only 
found in the fibroblasts. b-e, Proportions of fibroblasts and iPSCs with various 
karyotypes (n = 20 each). f, Schematic of chr 13 showing approximate 


45 chromosomes minus r(13) or non-clonal aberrations involving the 
ring structure (Fig. 4b, d and Extended Data Fig. 7a, d). Initial examina- 
tion of iPSCs at passage six revealed 46,XY,r(13) karyotypes, although a 
fraction of cells had lost the ring chromosome (Fig. 4c, e). However, six 
out of nine clones that were examined after passage eight had a normal 
karyotype (Fig. 4a, c, e and Extended Data Fig. 7). In addition, GM00285 
iPSC clone 4 transitioned from predominantly 46,XY,r(13) at passage 
six to 46,XY at passage twelve (Fig. 4c). These results demonstrate 
dynamic mosaicism between clonally related cells, leading to preferen- 
tial survival of karyotypically normal iPSCs in a matter of six passages, 
which did not occur in the fibroblasts (Fig. 4b, d). In agreement, FISH 
studies revealed normal signal patterns in the corrected iPSC clones 
(Fig. 4f-h). Finally, SNP microarray analysis confirmed rescue of the 
deleted regions (Extended Data Fig. 8a—e) and showed that corrected 
iPSCs are completely homozygous for chromosome 13 (Extended Data 
Fig. 8f, g). The repaired iPSCs expressed stem cell markers (Extended 
Data Fig. 2e-j), differentiated into three germ layers (Extended Data 
Fig. 9), and were free of exogenous factor integration (Extended Data 
Fig. 3c, d). All of the established iPSC lines in this study were confirmed 
to be identical to their original fibroblasts by DNA fingerprinting 
(Extended Data Table 1). 

Finally, to further rule out the possibility that repaired iPSC clones 
originated from rare 46,XY fibroblasts, we measured the reprogram- 
ming rates for the lines used in this study. If there is a bias against 
reprogramming from cells with ring chromosomes, then we would 
expect reduced reprogramming efficiency from ring-containing fibro- 
blast populations. However, we found no significant difference between 
wild-type and MDS cells, while reprogramming efficiency from r(13) 
fibroblasts was increased compared to wild type (Fig. 4i). Therefore, 
r(17) orr(13) chromosomes do not interfere with episomal reprogram- 
ming. More likely, reprogramming increases cell divisions, increasing 
the probability that random nondisjunction compensates for the loss 
of the corresponding ring. Nondisjunction leading to chromosomal 
trisomy is the most commonly reported karyotypic change in cultured 
human ES cells and iPSCs”’, because mitotic checkpoints that ensure 
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breakpoints and position of FISH probes used in g. g, Examples of signal 
patterns obtained with FISH probes. h, Proportions of cells with various signal 
patterns (n = 200 each). i, Number of TRA-1-60-positive colonies on day 25 
per 10,000 slectraporatcd cells. Results are mean + s.e.m. (n = 4). 


proper alignment and disentanglement of sister chromatids before 
separation are less efficient than in differentiated cells****. In this 
case, isodisomic cells have a growth advantage over cells with ring 
chromosomes including large deletions; however, they are not exactly 
the same as wild type. UPD could lead to undesirable clinical conse- 
quences through homozygosity for a recessive mutation or disruption 
of imprinting'*. Thus, expression of imprinted genes and presence of 
recessive mutations would have to be carefully monitored and/or cor- 
rected for any therapeutic applications. 

Compensatory UPD and dynamic mosaicism occur in actively prolif- 
erating cells in vivo, including cases involving ring chromosome 21, in which 
patients’ lymphocytes transitioned from predominantly 46,XX,r(21) to 
46,XX (with UPD21) several years later”*°. These observations further 
suggest that taking advantage of this intrinsic property of iPSCs may 
be a feasible approach to correct combined loss-of-function and struc- 
tural aberrations associated with additional ring chromosomes. Given 
that repair of large deletions involving multiple genes is not possible 
with current genome editing techniques, our results may have broad 
implications for regenerative medicine and advance the newly emerging 
concept of chromosome therapy’*. More broadly, our findings under- 
score the importance of studying chromosome repair in stem cells and 
establish a novel platform for investigating the mechanisms that regu- 
late chromosome number during early human development. 


METHODS SUMMARY 


The following primary human cells were used in this study: MDS1r(17) (GM06047), 
MDS2 (GM06097), MDS3 (GM09208), GM00285 and GM05563 (Coriell Institute 
for Medical Research); BJ and Detroit 551 (ATCC); HDF-1323 from a 48-year-old 
caucasian female (Cell Applications; 106-05a); HDF-WTc from a 30-year-old Japa- 
nese male, whose skin samples were collected at Gladstone. Human iPSCs were 
generated with the episomal Y4 plasmid mixture as described’” with some mod- 
ifications (available in the online Methods). Established hiPSC lines were cultured 
in mTeSR1 medium (Stem Cell Technologies) on dishes coated with Matrigel 
(BD) or Synthemax (Corning). Chromosomal analyses were performed on 20 
G-banded metaphase cells from each sample and FISH analyses were performed 
on two hundred interphase nuclei from each sample at Cell Line Genetics or the 
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University of California San Francisco cytogenetics laboratory. SNP genotyping to 
determine CNV presence and UPD status was performed using the Affymetrix 
Axiom EURarray platform by the Genomics Core Facility (GCF) at the University 
of California San Francisco for MDS samples and using the Agilent SurePrint G3 
Human CGH + SNP 4 X 180K array platform by Cell Line Genetics for r(13) samples. 


Online Content Any additional Methods, Extended Data display items and Source 
Data are available in the online version of the paper; references unique to these 
sections appear only in the online paper. 
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Citrullination regulates pluripotency and histone H1 


binding to chromatin 
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Citrullination is the post-translational conversion of an arginine 
residue within a protein to the non-coded amino acid citrulline’. 
This modification leads to the loss of a positive charge and reduc- 
tion in hydrogen-bonding ability. It is carried out by a small family 
of tissue-specific vertebrate enzymes called peptidylarginine deimi- 
nases (PADIs) and is associated with the development of diverse 
pathological states such as autoimmunity, cancer, neurodegenera- 
tive disorders, prion diseases and thrombosis”*. Nevertheless, the 
physiological functions of citrullination remain ill-defined, although 
citrullination of core histones has been linked to transcriptional 
regulation and the DNA damage response**. PADIA4 (also called 
PAD4 or PADV), the only PADI with a nuclear localization signal’, 
was previously shown to act in myeloid cells where it mediates pro- 
found chromatin decondensation during the innate immune res- 
ponse to infection’®. Here we show that the expression and enzymatic 
activity of Padi4 are also induced under conditions of ground-state 
pluripotency and during reprogramming in mouse. Padi4 is part of 
the pluripotency transcriptional network, binding to regulatory 
elements of key stem-cell genes and activating their expression. Its 
inhibition lowers the percentage of pluripotent cells in the early 
mouse embryo and significantly reduces reprogramming efficiency. 
Using an unbiased proteomic approach we identify linker histone 
H1 variants, which are involved in the generation of compact chro- 
matin”, as novel PADI4 substrates. Citrullination of a single argi- 
nine residue within the DNA-binding site of H1 results in its dis- 
placement from chromatin and global chromatin decondensation. 
Together, these results uncover a role for citrullination in the regu- 
lation of pluripotency and provide new mechanistic insights into 
how citrullination regulates chromatin compaction. 

Pluripotent cells have the capacity to self-renew and differentiate 
into all somatic and germ-cell lineages and, hence, possess therapeutic 
potential for a multitude of medical conditions. Their generation by 
reprogramming of differentiated somatic cells has been achieved by 
nuclear transfer, cell fusion and transduction of transcription factors, 
such as Oct4 (also called Pou5f1), Sox2, Klf4 and c-Myc”. Pluripotent 
cells have a distinctly open chromatin structure that is essential for 
unrestricted developmental potential’*"*, and reprogramming involves 
an almost complete epigenetic resetting of somatic cells’. The ability of 
PADI4-mediated histone citrullination to induce chromatin decon- 
densation in neutrophils’? prompted us to ask whether it can have a 
role in pluripotency, where chromatin decondensation is also neces- 
sary. To investigate this we first assessed the expression of Padi4 in the 
mouse embryonic stem-cell line ES Oct4-GIP (ES), the mouse neural 


stem-cell line NSO4G and in induced pluripotent stem (iPS) cells 
derived from NSO4G (see Methods). Padi4 is expressed in pluripotent 
ES and iPS cells but not multipotent neural stem cells (Fig. 1a). Culture 
of ES cells in 2i/LIF medium establishes a ground state of pluripo- 
tency'*. This leads to the downregulation of lineage-specific markers 
and the upregulation of pluripotency factors, as well as rapid induction 
of Padi4 expression (Fig. 1b). The pattern of Padi4 expression follows 
closely that of Nanog, an essential transcription factor for the transition 
to ground-state pluripotency”® (Fig. 1a, b). Whereas other Padi genes 
are expressed in pluripotent cells, Padi4 is the only one for which 
expression clearly associates with naive pluripotency (Extended Data 
Fig. 1a, b). Citrullination of histone H3 (H3Cit), a modification shown 
previously to be carried out specifically by PADI4 (ref. 17), is detectable 
in ES and iPS cells (Extended Data Fig. 1c), indicating that Padi4 is also 
enzymatically active. H3 and global citrullination are undetectable in 
NSO4G cells (Extended Data Fig. Ic, d). 

To determine the kinetics of Padi4 activation during the establish- 
ment of pluripotency, we examined RNA and protein samples col- 
lected daily during the course of reprogramming of NSO4G into iPS 
cells'®. Padi4 is rapidly induced in NSO4G after transduction of repro- 
gramming factors but only becomes active to citrullinate H3 after 
introduction of 2i/LIF, closely following the onset of Nanog expression 
(Fig. 1c). These observations strongly suggested that Padi4 activity is 
associated with ground-state pluripotency and prompted us to exam- 
ine whether Padi4 is part of the pluripotency transcriptional network. 

First, we asked whether the reprogramming factors regulate Padi4 
expression, using the ZHBTc4.1 and 2TS22C cell lines where Oct4 and 
Sox2, respectively, can be deleted acutely in response to doxycycline 
treatment (see Methods). Deletion of Oct4, but not Sox2, led to a 
decrease in Padi4 messenger RNA levels (Extended Data Fig. le). 
Furthermore, whereas Oct4 and K1f4 occupy the Padi4 promoter in 
ES, but not NSO4G, cells, Sox2 is bound in both cell types (Extended 
Data Fig. 1f). To understand the effects of Padi4 on transcriptional 
regulation in pluripotent cells, we analysed the transcriptome of ES 
cells upon overexpression of human PADI4 and inhibition of endo- 
genous Padi4. Several key pluripotency genes are upregulated in res- 
ponse to PADI4 overexpression (Fig. 1d, Extended Data Fig. 2a and 
Supplementary Table 2), including KIf2, Tcll, Tcfap2c and Kit. Tell 
was previously identified as the only regulator of self-renewal upregu- 
lated in ground-state pluripotency’’, and overexpression of Tcll or 
Tcfap2c positively influences reprogramming'®. Gene Ontology (GO) 
analysis of this data set indicates an enrichment of genes involved in stem- 
cell development and maintenance (Fig. le). In addition, knockdown 
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Figure 1 | Padi4 expression and activity are features of pluripotent cells. 

a, b, RT-PCR for Padi4 and Nanog expression in ES, NSO4G and iPS cells 
(a), and in ES cells on culture in 2i/LIF for one passage (b). Pow5f1, Olig2 and 
Paxé6 are presented as controls. Expression normalized to ubiquitin (UbC). 
Error bars indicate standard error of the mean of three biological replicates. 
c, RT-PCR for Padi4 and Nanog expression and H3Cit immunoblot during 
the course of reprogramming (see also Extended Data Fig. 5a). Loading control: 
total histone H3. Representative of four experiments. d, Heat map of the genes 
regulated on human PADI4 (hPADI4) overexpression in mouse ES cells, as 
determined by microarray analysis. Displayed values are normalized log 
intensities, minus the mean expression of the gene across the four samples. 
Hierarchical clustering based on correlation. e, Gene Ontology for Biological 
Process (GOBP) analysis of the above microarray data set. P value is corrected 


of Padi4 in mouse ES cells leads to decreased expression of Tcl1 and 
Nanog, which is rescued by exogenous expression of RNA interference 
(RNAi)-resistant human PADI4 (Fig. 1fand Extended Data Fig. 2b, c). 
These genes are under the control of PADI4 enzymatic activity, as 
treatment with the chemical inhibitor Cl-amidine, which disrupts 
citrullination by PADI4 (ref. 19), downregulates their expression 
(Fig. 1g). Chromatin immunoprecipitation (ChIP) analysis indicated 
that H3Cit is present on regulatory regions of Tcll and Nanog in ES 
and iPS cells, but not NSO4G (Fig. 1h and Extended Data Fig. 3a). 
Accordingly, exogenously expressed human PADI4 localizes to and is 
enzymatically active on these regions, as well as regulatory regions of 
Kif2 and Kit in ES cells (Extended Data Fig. 3b, c). In contrast to 
PADI4 overexpression, treatment of ES cells with Cl-amidine led to 
upregulation of differentiation markers such as Prickle1, Ephal and 
Wnt8a and downregulation of pluripotency markers such as Kif5 
(Extended Data Fig. 4a, b and Supplementary Table 3), in addition 
to Nanog and Tcl1 (Fig. 1g). GO analysis of this data set indicated 
enrichment in genes involved in cell differentiation (Extended Data 
Fig. 4c). Pou5fl, Kif4, Sox2 and c-Myc were not affected by PADI4 
modulation (Extended Data Fig. 2a and Supplementary Tables 2 and 
3). Cumulatively, the above results place Padi4 within the pluripotency 


for multiple testing using Benjamini and Hochberg false discovery rate (FDR). 
f, g, RT-PCR for Tcll and Nanog expression in mouse ES cells after transient 
knockdown with Padi4 or control (ctrl) shRNAs, and overexpression of 
human PADI4 or control vector (pPB ctrl) (f), and after treatment with 200 1M 
Cl-amidine (g). Expression normalized to UbC. Error bars indicate standard 
error of the mean of three biological replicates. h, ChIP-qgPCR for H3Cit on 
regulatory regions of Tcll and Nanog in mouse ES, NSO4G and iPS cells. Error 
bars indicate standard deviation of three technical qPCR replicates. Data are 
representative of three experiments. Asterisks denote difference with ES cells 
(a) or ES media (b), control (f, g) and between samples (f); NS, not significant; 
*P = 0.05, **P = 0.01, ***P = 0.001, ****P = 0.0001, by ANOVA (a, f) or 
t-test (b, g). 


transcriptional network, indicating that it acts downstream of some of 
the cardinal reprogramming factors to regulate a specific subset of 
pluripotency genes. 

Prompted by the above observations, we investigated whether Padi4 
is necessary for pluripotency, as assessed during reprogramming (Extended 
Data Fig. 5a) and in the early stages of embryo development. NSO4G 
cells express a green fluorescent protein (GFP) reporter under the control 
of the Oct4 regulatory sequences, which is activated on acquisition of 
pluripotency’®, allowing us to trace reprogrammed cells. Knockdown 
of Padi4 in NSO4G-derived pre-iPS cells impaired the ability of the 
cells to establish H3Cit upon switch to 2i/LIF medium and led to a 
clear reduction in reprogramming (Fig. 2a, Extended Data Fig. 5b-e and 
Supplementary Video 1). Consistent with this finding, levels of Tcll and 
Nanog were not elevated on reprogramming to the same extent as in 
control cells (Fig. 2b and Extended Data Fig. 5f). Cl-amidine treatment 
led to a marked reduction of reprogramming efficiency and H3Cit (Fig. 2c 
and Extended Data Fig. 5g-i), indicating that the catalytic activity of 
Padi4 is important for the induction of pluripotency. 

Padi4 expression and H3Cit are detected in the early embryo” and 
Padi4-null mice are born in lower numbers than would be expected by 
Mendelian inheritance”, indicating that Padi4 loss affects embryonic 
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Figure 2 | Citrullination and Padi4 regulate 
pluripotency during reprogramming and early 
embryo development. a, Flow cytometry analysis 
and phase contrast/fluorescence images for the 
assessment of Oct4—GFP reporter expression after 
reprogramming of pre-iPS cells stably expressing 
Padi4 and control shRNAs. Data are representative 
of four independent experiments. See 
Supplementary Video 1 for time-lapse video. 

b, qRT-PCR for expression of Tcl1, Nanog and 
Padi4 at the end of the above reprogramming assay. 
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development. To assess the role of Padi4 in early development, we cul- 
tured mouse embryos in Cl-amidine-containing medium from the 2-cell 
stage and throughout pre-implantation development (see Methods 
and Extended Data Fig. 6a—c). Using 200 UM Cl-amidine resulted in 
a complete developmental arrest of the embryos at the 8-cell stage 
(Extended Data Fig. 6a). We therefore used the maximum dose of 
Cl-amidine that reduced H3Cit (Extended Data Fig. 6b, c) but did 
not induce arrest (10 1M). This led to a reduced percentage of plur- 
ipotent Nanog-positive epiblast cells and an increased percentage of 
differentiated trophectoderm cells at the blastocyst stage (Fig. 2d, e and 
Extended Data Fig. 6d, e). Time-course analyses of the cleavage patterns 
and cell-fate decisions in early embryos showed that Cl-amidine increased 
the number of symmetric cell divisions at the expense of asymmetrical 
divisions at the 8-16- and 16-32-cell transitions (Fig. 2f, g). This resulted 
in 16-cell-stage embryos with fewer inner cells (destined for pluripo- 
tency) and greater numbers of outer cells (destined for differentiation 
into trophectoderm; reviewed in ref. 23) (Extended Data Fig. 6f). 
Treatment with another Padi4 inhibitor, Thr-Asp-F-amidine (TDFA)™, 
but not the HDAC inhibitor trichostatin A (TSA), had similar effects 
(Extended Data Figs 7 and 8). These results indicate that Padi4 activity 
also promotes the maintenance of pluripotent cells in the early mouse 
embryo. 

To elucidate the molecular mechanisms by which Padi4 regulates 
pluripotency, we aimed to identify PADI4 substrates in the chromatin 
fraction of mouse ES cells using stable isotope labelling of amino acids 
in cell culture (SILAC) (Fig. 3a and Extended Data Fig. 9a, b). Among 
the identified PADI4 substrates were Atrx, Dnmt3b, Trim28 and var- 
iants of linker histone H1 (Fig. 3b-e, Extended Data Figs 9c-f and 10a, b, 
and Supplementary Table 4), all of which can have an impact on plu- 
ripotency. Histone H1 stabilizes the nucleosome and facilitates chro- 
matin condensation, a state that is less permissive to processes that 
require access to the DNA, such as transcription”. The identified citrul- 
linated H1 peptides correspond to, and are common between, variants 
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Error bars indicate standard error of the mean of 
triplicate samples. c, Flow cytometry analysis and 
phase contrast/fluorescence images for the 
assessment of Oct4—GFP reporter expression after 
reprogramming assay of pre-iPS cells treated with 
200 LM Cl-amidine. Representative of three 
independent experiments. d, E4.5 blastocysts from 
2-cell-stage embryos treated with 10 1M Cl- 
amidine. Sox17 (primitive endoderm marker, red), 
Cdx2 (trophectoderm marker, green), Nanog 
(epiblast marker, white) and Hoechst 33342 (blue) 
images are shown. e, Distribution of inner-cell 
mass (ICM) versus trophectoderm (TE) cells in 
E3.5 blastocyst treated as above. f, g, Time-lapse 
analysis of embryos in 10 1M Cl-amidine from 
2-cell stage. Number of symmetric versus 
asymmetric divisions at the 8-16-cell transition 
(f) and type of divisions at the 16-32-cell transition 
(g) is shown. Error bars indicate standard error of 
the mean. Statistical significance was determined 
by unpaired t-test (b), or Mann-Whitney U-test on 
non-normal distribution (e-g). Asterisks denote 
difference with control; *P <= 0.05, **P <= 0.01, 
***P =< 0.001, ****P = 0.0001. 
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H1.2, H1.3 and H1.4 (Fig. 3d, e), whereas an additional peptide corre- 
sponds to the same residue in H1.5 (Extended Data Fig. 10a, b). In ES 
cells, H1.2, H1.3 and H1.4 are required for chromatin compaction”’, 
whereas their depletion leads to increased expression of pluripotency 
genes such as Nanog and stalls them in a self-renewal state with impaired 
differentiation capability*®. Notably, H1 is more loosely bound to chro- 
matin in ES cells than in differentiated cells'* and its genomic local- 
ization in cancer cells was shown to anti-correlate with that of PADI4 
(ref. 7). Mass spectrometric analysis accounted for all arginine residues 
within H1 but indicated that Arg54 is the only site citrullinated by 
PADI4 (Fig. 3e and Extended Data Fig. 9c). Indeed, we found that 
although H1.2 is citrullinated in ES cells (by endogenous Padi4, and 
significantly increased on PADI4 overexpression), it is refractory to 
modification when Arg 54 is mutated (Fig. 3f). Similar results were 
obtained in in vitro citrullination assays (Extended Data Fig. 10c). H1 
Arg 54 lies within the globular domain of H1 (Extended Data Fig. 10d), 
which is highly conserved among the linker histone family and is 
necessary for interaction with nucleosomal DNA”’*. To test whether 
H1 Arg 54 is necessary for binding of H1 to nucleosomes, we mutated 
and assessed it in nucleosome-binding assays. Figure 3g shows that an 
R54A mutant, which mimics the charge change that accompanies citrul- 
lination, is impaired for nucleosome binding. An R54K mutant, which 
retains the positive charge, is impaired to a lesser extent (Fig. 3g), indi- 
cating that H1R54 is important for electrostatic interactions between 
H1.2 and the nucleosome. The above results open up the possibility 
that PADI4 may affect chromatin compaction in pluripotent cells. To 
test this hypothesis, we first assessed whether citrullination by ectopic 
PADIA4 can lead to decondensation of differentiated cell chromatin. 
Recombinant PADI4 was added to permeabilized and stabilized dif- 
ferentiated C2C12 mouse myoblast nuclei (Fig. 4a). This protocol 
ensures stabilization of the nuclear component while allowing the free 
diffusion of non-chromatin-bound nuclear proteins into the extranuclear 
fraction, and their collection by washing. Incubation with active PADI4 
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Figure 3 | PADI4 citrullinates Arg 54 on linker histone H1 and affects its 
binding to nucleosomal DNA. a, Experimental strategy for screening for 
PADI4 citrullination substrates in the chromatin fraction of ES cells. b, Scatter 
plot representing the overall fold change for all identified citrullination sites. 
Red diamonds indicate PADI4-regulated citrullinations. c, Table representing 
the 40 most highly regulated PADI4 substrates, their individual citrullination 
sites and the log, SILAC ratio. Complete data set is available in Supplementary 
Table 4. d, Quantification of citrullination site Arg 54 on H1.2 through 


(Extended Data Fig. 10e, f) leads to the eviction of H1 from the chro- 
matin and its diffusion out of the permeabilized nucleus (Fig. 4b). The 
evicted H1 is citrullinated on Arg 54, as determined by mass spectro- 
metry (Extended Data Fig. 10g, h). Consistent with this, PADI4-treated 
cells showed evidence of decondensed chromatin, as determined by 
nuclear swelling, diffuse 4’ ,6-diamidino-2-phenylindole (DAPI) stain- 
ing and increased sensitivity to micrococcal nuclease (Fig. 4c, d and 
Extended Data Fig. 10i). Similar results were observed when PADI4 
was overexpressed in C2C12 cells (Fig. 4e) or NSO4G cells (data not 
shown). To monitor whether Padi4 can affect H1 binding on pluripotent 


WB: H1.2 


differential regulation of the triply charged peptide ERSGVSLAALKK. 

e, Fragmentation spectra of the triply charged and heavy SILAC labelled LysC 
peptide ERSGVSLAALKK surrounding Arg 54 of H1.2. The y and b series 
indicate fragments at amide bonds of the peptide. f, Citrullination immunoblot 
of wild-type (WT) and R54A mutant GFP-tagged H1.2 expressed and pulled 
down from ES cells expressing PADI4 or control vector (mock). Control for the 
efficiency of the pull down: GFP. g, Nucleosome pull-down assay using wild- 
type and Arg 54-mutant H1.2. WB, western blot. 


cell chromatin, we performed ChIP-qPCR (quantitative PCR) analyses 
of H1.2 on the regulatory regions of Tcl1 and Nanog and found that it is 
stabilized on Padi4 knockdown (Fig. 4f). The ability of Padi4 to disrupt 
the binding of H1 to nucleosomal DNA provides a novel mechanis- 
tic example of how citrullination regulates protein function and chro- 
matin condensation. 

The work described here identifies citrullination of chromatin com- 
ponents by Padi4 as a feature of pluripotency (Fig. 4g), in addition to its 
previously described role in the myeloid lineage. One of the reasons for 
the restricted expression pattern of Padi4 may be the requirement for 
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an open chromatin state in these cell types. The selective expression 
characteristics and the inducible nature of the catalytic activity of Padi4 
indicate that it is under tight spatial and temporal regulation, giving it a 
unique status among chromatin-modifying enzymes. As such, inap- 
propriate Padi4 activity may have deleterious consequences, which 
may explain its activation in cancers of varying origin during progres- 
sion to malignancy”. Indeed, citrullination is a common feature of several 
unrelated diseases, suggesting that strict regulation is probably a require- 
ment for the physiological function of all PADIs. During review of 
this manuscript, a study suggested that PADI2, thought to be mainly 
cytoplasmic, can also citrullinate histones and lead to transcriptional 
activation*’. This opens up the possibility that other PADIs may medi- 
ate nuclear events in specific contexts, including in pluripotent cells. 
Further research into the function and targets of PADIs is likely to 
illuminate the aetiology of several pathologies. 


METHODS SUMMARY 


Information regarding cell lines, antibodies, plasmids and chemical inhibitors used 
in this study, as well as detailed protocols for reprogramming and cell culture, 
overexpression and knockdown, gene expression analysis (qRT-PCR and micro- 
array), chromatin immunoprecipitation, embryo collection, culture, immuno- 
fluorescence staining and analysis, immunoblot analysis, SILAC labelling, mass 
spectrometry and identification of citrullinated peptides and proteins, native puri- 
fication of wild-type and mutant linker histone H1.2, HI-GFP and nucleosome 
pull-down assays, purification of recombinant PADI4-GST and in vitro deimina- 
tion assays, treatment of permeabilized cells with recombinant PADI4, and micro- 
coccal nuclease digestion are provided in the Methods. 


Online Content Any additional Methods, Extended Data display items and Source 
Data are available in the online version of the paper; references unique to these 
sections appear only in the online paper. 
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Ultraviolet -radiation-induced inflammation 
promotes angiotropism and metastasis in melanoma 
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Stefanie Riesenberg®, Debby van den Boorn-Konijnenberg’, Cornelia Hémig-Hdlzel’, Raphael Reuten*, Benjamin Schadow’, 
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Raymond L. Barnhill!°, Manuel Koch*, Bernd K. Fleischmann®, Irmgard Forster?, Wolfgang Kastenmiiller, Waldemar Kolanus’, 


Michael Hélzel*, Evelyn Gaffal! & Thomas Tiiting! 


Intermittent intense ultraviolet (UV) exposure represents an import- 
ant aetiological factor in the development of malignant melanoma’. 
The ability of UV radiation to cause tumour-initiating DNA mutations 
in melanocytes is now firmly established’, but how the microenvir- 
onmental effects of UV radiation** influence melanoma pathogenesis 
is not fully understood. Here we report that repetitive UV exposure 
of primary cutaneous melanomas in a genetically engineered mouse 
model’ promotes metastatic progression, independent of its tumour- 
initiating effects. UV irradiation enhanced the expansion of tumour 
cells along abluminal blood vessel surfaces and increased the number 
of lung metastases. This effect depended on the recruitment and 
activation of neutrophils, initiated by the release of high mobility 
group box 1 (HMGB1) from UV-damaged epidermal keratinocytes 
and driven by Toll-like receptor 4 (TLR4). The UV-induced neutro- 
philic inflammatory response stimulated angiogenesis and promoted 
the ability of melanoma cells to migrate towards endothelial cells 
and use selective motility cues on their surfaces. Our results not only 
reveal how UV irradiation of epidermal keratinocytes is sensed by 
the innate immune system, but also show that the resulting inflam- 
matory response catalyses reciprocal melanoma-endothelial cell inter- 
actions leading to perivascular invasion, a phenomenon originally 
described as angiotropism in human melanomas by histopatholo- 
gists®. Angiotropism represents a hitherto underappreciated mech- 
anism of metastasis’ that also increases the likelihood of intravasation 
and haematogenous dissemination. Consistent with our findings, 
ulcerated primary human melanomas with abundant neutrophils 
and reactive angiogenesis frequently show angiotropism and a high 
risk for metastases. Our work indicates that targeting the inflam- 
mation-induced phenotypic plasticity of melanoma cells and their 
association with endothelial cells represent rational strategies to 
specifically interfere with metastatic progression. 

Melanocytic nevi exposed to doses of UV light at a level causing 
sunburn show increased suprabasal location of melanocytes and immune 
cell infiltrates reminiscent of primary melanomas*. Experiments in mice 
revealed that UV-induced skin inflammatory responses can cause the 
reactive proliferation and migration of melanocytes*. We speculated 
that these microenvironmental effects of UV irradiation promote mel- 
anoma development by stimulating the survival, expansion and dissem- 
ination of DNA-damaged melanocytes. To investigate this hypothesis, 
we used genetically engineered HGF-CDK4(R24C) mice in which deregu- 
lated receptor tyrosine kinase signalling due to transgenic overexpres- 
sion of hepatocyte growth factor (HGF) and impaired cell cycle control 


due to an oncogenic CDK4(R24C) mutation in the germline cause the 
development of invasive and metastatic melanomas as seen in patients’. 

After two erythemal doses of UV light, the skin of adult HGF- 
CDK4(R24C) mice showed a prominent inflammatory infiltrate con- 
sisting predominantly of CD11b’ Ly6C*Ly6G™ neutrophils and, to a 
lesser extent, of CD11b*Ly6C*Ly6G™ inflammatory monocytes and 
CD11b*Ly6C Ly6G~ macrophages along with reactive proliferation 
of epidermal keratinocytes and peripheral blood neutrophilia (Extended 
Data Fig. la-c). Exposure of HGF-CDK4(R24C) mice to UV twice 
weekly for 6 weeks led to accumulation of melanocytes predominantly 
in the upper dermis along epidermal basement membranes (Extended 
Data Fig. 1d). To study the tumour-promoting effects of UV irradiation 
independent of its tumour-initiating effects, we first initiated melano- 
mas in the back skin of adult HGF-CDK4(R24C) mice with the car- 
cinogen 7,12-dimethylbenz(a)anthracene (DMBA) and then compared 
tumour development in UV-irradiated and control mice (Fig. 1a). 
Unexpectedly, we observed that repetitive UV exposure did not affect 
the incidence, multiplicity and growth kinetics of DMBA-induced 
primary melanomas in the skin, but rather increased the number of 
lung metastases (Fig. 1b, c). 

In 4 out of 20 UV-irradiated HGF-CDK4(R24C) mice, we observed 
macroscopically visible expansion of melanoma cells along endothelial 
cell surfaces of dermal blood vessels (Fig. 1d, e and Extended Data Fig. 2a-c). 
This angiotropic growth pattern also occurs in about one fifth of patients 
with human melanomas as an early sign of metastasis (Fig. 1f) and 
indicates a poor prognosis®. Transmission electron microscopy showed 
melanoma cells covering the abluminal side of endothelial cell surfaces 
with occasional intravasation into the vessel lumen (Extended Data 
Fig. 2d). Careful analyses of angiotropism, melanoma-infiltrating immune 
cells and metastatic spread to draining lymph nodes and lungs (Extended 
Data Fig. 2e-l) revealed a correlation between UV exposure, neutrophilic 
inflammation, perivascular invasion and spontaneous lung metastases 
(Fig. 1g). Interestingly, angiotropic growth was recapitulated in meta- 
stases of some UV-irradiated mice (Fig. 1h and Extended Data Fig. 2i, j). 

Recently, the TLR3-TRIF (TRIF, also known as TICAM1) pathway 
was shown to play a role for the recognition of UV-induced alterations 
in the double-stranded domains of some non-coding RNAs derived 
from epidermal keratinocytes’. We therefore studied the role of innate 
immune pathways for the induction of skin inflammatory responses 
following UV irradiation. We observed substantially reduced neutro- 
philic inflammation and diminished reactive keratinocyte hyperproli- 
feration after two erythemal doses of UV in Tir4 ’~ and Myd88’~ 
mice but not in Tir3 ‘~ and Trif ‘~ mice when compared to wild-type 


1Laboratory of Experimental Dermatology, Department of Dermatology and Allergy, University of Bonn, 53115 Bonn, Germany. Molecular Immunology and Cell Biology, Life and Medical Sciences Institute, 
University of Bonn, 53115 Bonn, Germany. *Unit for RNA Biology, Department of Clinical Chemistry and Clinical Pharmacology, University of Bonn, 53105 Bonn, Germany. “Institute for Dental Research 
and Oral Musculoskeletal Biology, Center for Biochemistry, Medical Faculty, University of Cologne, D-50931 Cologne, Germany. “Immunology and Environment, Life and Medical Sciences Institute, 
University of Bonn, 53115 Bonn, Germany. “Institute for Physiology |, Life & Brain Center, University of Bonn, 53105 Bonn, Germany. Department of Dermatology, University Hospital Essen, 45122 Essen, 
Germany. ®Institute of Cardiovascular Research and Sport Medicine, Department of Molecular and Cellular Sport Medicine, German Sport University Cologne, 50933 Cologne, Germany. °Division of 
Genetics and Cell Biology, San Raffaele University and Scientific Institute, 20132 Milan, Italy. ‘Department of Pathology and Laboratory Medicine, Jonsson Comprehensive Cancer Center, University of 
California Los Angeles (UCLA) Medical Center, Los Angeles, California 90095, USA. 1Institutes of Molecular Medicine and Experimental Immunology, University of Bonn, 53105 Bonn, Germany. 


6 MARCH 2014 | VOL 507 | NATURE | 109 
©2014 Macmillan Publishers Limited. All rights reserved 


LETTER 


a Control Analysis of 
4.5 kJ m-? UV skin and lungs 
8-10-week-old 4 Y Y v y ¥ x A 
ge 
HGF-CDK4 Ken ' d 
fale S ) 7 10 1417 98 102105 — (ays) 
b 20 10 c 
15 8 
2 10 6 
6 5 4 
1S) 
2 


0 
0 35 70 105 140 0 35 70 105 140 


10 


Number of melanomas per mouse 
Diameter of largest melanoma (mm) 


8 
6 
4 
2 


0 
0 35 70 105 140 0 35 70 105 140 
Time after tumour initiation (days) 


f 


sasej}sejew Bun] Jo JaqunN 


01201 2 
Angiotropism score 
in primary melanomas 


Figure 1 | UV irradiation of skin with DMBA-initiated primary melanomas 
in HGF-CDK4(R24C) mice selectively promotes angiotropic invasion and 
lung metastasis. a, Experimental model. b, Macroscopic appearance (left), 
mean number (middle) and growth kinetics (right) of DMBA-induced skin 
melanomas in representative cohorts of 5 UV-exposed and control mice. 

c, Macroscopic appearance of lung metastases. d, e, Macroscopically visible 
melanoma cell expansion along dermal blood vessels (arrows) in an UV- 
irradiated mouse and corresponding haematoxylin and eosin (H&E)-stained 
section and immunofluorescence analysis. f, Angiotropic growth in a human 
primary melanoma. g, Correlation of UV exposure, skin melanoma 
angiotropism and lung metastasis (n = 20; Kruskal-Wallis test with Dunn’s 
correction for multiple comparison). h, Angiotropic growth of metastatic 
melanoma in the lungs of an UV-irradiated mouse. 


controls (Fig. 2a, b). Upregulation of p21 mRNA indicating DNA 
damage responses was similar in UV-treated mice of all genotypes. 
The UV-induced induction of skin angiogenesis was also selectively 
decreased in Tlr4-’~ but not in TIr3-’~ mice compared to controls 
(Fig. 2c). Experiments with conditional gene knock-in mice expressing 
MYD88 exclusively in LysM* or K5* cells revealed that UV-induced 
skin inflammation involves MYD88-dependent TLR4-signalling pre- 
dominantly in myeloid immune cells and only to a lesser extent in 
keratinocytes (Fig. 2a, b and Extended Data Fig. 3a-e). 

HMGB1 is a ubiquitously expressed nuclear protein that partici- 
pates in DNA repair’® and also has TLR4-dependent inflammatory 
cytokine activities following its release into the extracellular space by 
severely stressed cells'’*. Upon UV irradiation, we observed cytosolic 
translocation and release of HMGB1 by epidermal keratinocytes and 
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Figure 2 | UV irradiation induces a TLR4/MYD88-driven neutrophilic 
skin inflammatory response that is initiated by HMGB1 release from 
UV-damaged keratinocytes. a, Differential effect of two UV radiation doses, 
at a level to cause sunburn, on skin immune cell infiltration and keratinocyte 
hyperproliferation in wild-type and Tir4'~ mice (left) and relative increases of 
immune cells and epidermal thickness in mice with the indicated genotypes 
(right). b, Corresponding relative increases of skin and blood neutrophils and 
skin mRNA levels for the indicated genes. c, Macroscopic and microscopic 
analyses of dermal blood vessels. d, f, Immunofluorescence analysis for 
HMGB1 in UV-irradiated and control skin or cultured keratinocytes. 

e, g, Corresponding immunoblots for HMGB1 in serum, cell lysates or culture 
medium. h, Effect of treatment with glycyrrhizin (Gly), Box-A, CLI-095 (CLI) 
or vehicle control on UV-induced keratinocyte proliferation, neutrophil 
recruitment and skin mRNA levels. All results are representative for 6 mice, 
error bars indicate the mean + s.e.m., *P < 0.05; **P < 0.01; ***P< 0.001 
(unpaired two-tailed Student’s t-test). 
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increased HMGB1 serum levels (Fig. 2d, e). HMGBI cytosolic trans- 
location and release was also found in UV-irradiated primary kerati- 
nocyte cultures (Fig. 2f, g). Treatment of mice with recombinant BoxA 
or glycyrrhizin, two functional inhibitors of extracellular HMGBI, or 
with CLI-095, a specific inhibitor of TLR4 signalling, largely abrogated 
UV-dependent neutrophilic inflammation and reactive epidermal ker- 
atinocyte hyperproliferation (Fig. 2h). Similar results were obtained 
after antibody-mediated depletion of Ly6G~ neutrophils, whereas clo- 
dronate liposome-mediated depletion of macrophages was ineffective 
(Extended Data Fig. 3f, g). These results provide a HMGB1-dependent 
mechanistic link between UV-induced DNA damage of epidermal 
keratinocytes, and a TLR4/MYD88-dependent feed-forward signalling 
loop predominantly active in neutrophil leukocytes which orchestrates 
skin inflammation. 

To evaluate whether the UV-induced, TLR4/MYD88-driven neutro- 
philic inflammatory response was also causal for increased angiotropism 
and metastasis of melanoma, we took advantage of serial HGF-CDK4 
(R24C) melanoma skin transplants that spontaneously develop lung 
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Figure 3 | Increased angiotropism and metastatic dissemination of skin 
melanomas in UV-irradiated mice depends on TLR4-driven neutrophilic 
inflammation. a, Experimental model. b, Growth kinetics of serial HCmel12 
melanoma skin transplants in UV-exposed and control mice (n = 5, left), 
macroscopic appearance of lung metastases (right). c, Angiotropic growth of 
EGFP-expressing HCmel12 cells in UV-irradiated skin. H&E stained section 
(left) and 3D reconstructed whole mount confocal immunofluorescence image 
(right). d, H&E-stained section of a spontaneous HCmel12 lung metastasis 
(left) and effect of UV exposure on skin melanoma angiotropism and lung 
metastasis (n = 15, right). e, Effect of UV irradiation on angiotropism and lung 
metastasis in TLR4-deficient mice (n = 12). f, g, Effect of antibody-mediated 
Ly6G™ neutrophil depletion and TLR4-inhibition on UV-induced 
angiotropism and lung metastases (n = 12). Cumulative results of 3 
independent experiments are shown in d-g (Kruskal-Wallis test with Dunn’s 
correction for multiple comparison). 
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metastases. Repetitive UV exposure of the back skin comprising the 
tumour transplantation site of HCmel12 melanomas did not affect local 
growth kinetics but increased angiotropic invasion and lung metastases 
when compared to non-irradiated controls (Fig. 3a-d and Extended 
Data Fig. 4a—e). Similar results were obtained with a second, indepen- 
dently generated serial skin transplant HCmel31. The expansion of 
enhanced green fluorescent protein (EGFP)-expressing HCmel12 cells 
in the inflamed skin of UV-irradiated mice could be visualized by 
whole-mount confocal immunofluorescence microscopy two weeks 
after transplantation (Fig. 3c). This serial transplant model allowed 
us to study melanoma cell behaviour following UV irradiation in mice 
with genetic deficiencies of the TLR4/MYD838 signalling axis. 

We found that the UV-dependent increase of skin melanoma angio- 
tropism and spontaneous lung metastasis was completely abrogated 
in Tira ‘~ and Myd88 ‘~ mice (Fig. 3e and Extended Data Fig. 4f, g). 
Investigations with conditional Myd88 knock-in mice revealed that 
MYD88 signalling was predominantly required in myeloid immune 
cells (Extended Data Fig. 4h, i). Antibody-mediated depletion experi- 
ments confirmed a critical role for Ly6G* neutrophils (Fig. 3f and 
Extended Data Fig. 4j, k). Importantly, the UV-dependent increase 
of angiotropism and lung metastases was also abolished by pharma- 
cologic inhibition of extracellular HMGB1 or TLR4 signalling (Fig. 3g 
and Extended Data Fig. 41, m). These observations in an independent 
experimental setting recapitulated key features of our findings with 
DMBA-induced primary melanomas and unambiguously demonstrated 
that UV-induced, TLR4/MYD88-driven neutrophilic inflammation 
promotes angiotropism and metastasis in melanoma. 

Next, we studied melanoma-endothelial cell interactions under inflam- 
matory conditions in vitro. In transwell assays, we observed that pre- 
incubation of HCmel12 cells with neutrophil-conditioned medium 
enhanced their migration towards endothelial cells and, to a lesser 
extent, towards keratinocytes. This was mimicked by TNF, a key inflam- 
matory cytokine present in UV-induced skin that is also secreted by 
activated neutrophils (Fig. 4a and Extended Data Fig. 5a, b). Inflam- 
matory activation of HCmel12 cells with TNF also increased their 
adhesion to vascular extracellular matrix components (Fig. 4b). Time- 
lapse video microscopy revealed that HCmel12 cells selectively migrated 
with increased velocities and for longer distances on endothelial cell 
surfaces when compared to surfaces of keratinocytes or purified extra- 
cellular matrix components alone, an effect that was significantly enhanced 
by TNF (Fig. 4c, Extended Data Fig. 5c and Supplementary Videos 1 
and 2). TNF present in neutrophil-conditioned medium also enhanced 
endothelial cell sprouting from aortic ring explants and promoted 
the pericyte-like expansion of co-cultured melanoma cells along such 
endothelial outgrowths (Fig. 4d and Extended Data Fig. 5d-g). Finally, 
we could demonstrate that melanoma cells preferentially migrated 
towards, and tightly associated with, blood vessel endothelial cells in 
inflamed ear tissue explants from UV-irradiated mice (Fig. 4e, Extended 
Data Fig. 5h-] and Supplementary Videos 3 and 4). 

A panel of human melanoma cell lines carrying characteristic BRAF 
or RAS mutations also showed enhanced migration towards human 
vascular endothelial cells in transwell assays and adhered more firmly 
to vascular extracellular matrix components following inflammatory 
activation with TNF (Fig. 4f, g and Extended Data Fig. 6a—c). Human 
melanoma cells also selectively migrated more efficiently on endothe- 
lial cell surfaces when compared to surfaces of keratinocytes or extra- 
cellular matrix components when activated with TNF (Fig. 4h and 
Extended Data Fig. 6d, e). Human melanoma cells also invaded inflamed 
mouse ear tissue explants and closely associated with blood vessel 
endothelial cells (Fig. 4i and Extended Data Fig. 6f). Collectively, our 
observations in these ex vivo and in vitro assays provided evidence that 
a neutrophilic inflammatory response promoted angiotropic properties 
of melanoma cells, reflected by the selective migration towards and on 
endothelial cell surfaces, both in the HGF-CDK4(R24C) mouse model 
and in human melanoma cells bearing common genetic alterations. 
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Bioinformatic analyses of public data sets revealed that inflammatory 
activation of human melanoma cells with TNF induces genes enriched 
for the GO terms ‘cell migration’, ‘cell adhesion’ and ‘angiogenesis’ 
(Extended Data Fig. 7 and Supplementary Tables 1 and 2). A subset 
of melanoma cell lines exhibits a partial baseline activation of this TNF- 
responsive signature, which includes genes implicated in vasculogenic 
mimicry’* and genes induced upon co-culture with endothelial cells’*. 
We could confirm these in global transcriptional responses to TNF 
which strongly correlated in a set of well-characterized BRAF and RAS 
mutant melanoma cell lines” (Extended Data Fig. 8a, b and Supplemen- 
tary Table 3). A core set of TNF-regulated genes known to be involved 
in tumour-endothelial interactions and angiogenesis was further vali- 
dated by quantitative PCR with reverse transcription (qRT-PCR) in 
human and mouse melanoma cells (Extended Data Fig. 8c). Together 
with our experimental observations, these data support the emerging 
concept that environmental inflammatory signals can increase tumour 
cell plasticity and promote embryonic cell phenotypes’*"*. Indeed, neural 
crest progenitors of melanocytes migrate along blood vessel surfaces 
into the skin during embryogenesis'”~””. 


112 | NATURE | VOL 507 | 6 MARCH 2014 


Figure 4| An inflammatory environment promotes the ability of mouse 
and human melanoma cells to migrate towards endothelial cells and use 
selective motility cues on their surfaces. a, Migration of HCmel12 melanoma 
cells towards bEND.5 endothelial cells or SP1 keratinocytes in transwell 
assays, influence of neutrophil-derived inflammatory mediators (Neu) and 
role of TNF. Shown are mean number of migrated cells in triplicate 
determinations + s.e.m. b, Static adhesion of TNF-activated and control 
HCmel12 cells to the indicated extracellular matrix components. Shown are 
mean of triplicate determinations + s.e.m. c, Two-dimensional (2D) migration 
tracks of individual control and TNF-activated EGFP-expressing HCmel12 
cells on a surface of confluent bEND.5 endothelial cells imaged by time lapse 
microscopy over 12h (upper panel), average accumulated distances and 
migration velocities of HCmel12 cells on BEND.5 endothelial cell or SP1 
keratinocyte surfaces (lower panel, n = 70 tracks of individual cells, bars 
indicate the mean). d, e, Confocal immunofluorescence images showing 
EGFP-expressing HCmel12 melanoma cells in a pericyte-like association 
with activated CD31* endothelial cells sprouting from inflamed aortic rings 
or present in ear tissue explants from UV-irradiated mice. f-i, Similar 
experimental results in transwell migration, static adhesion, 2D cell surface 
migration (on HUVEC vs HaCaT keratinocytes) and ear tissue invasion 
assays with the indicated human melanoma cell lines. Data in a and f are 
representative for 1 out of 5, data in b, c and g, h for 1 out of 3 independently 
performed experiments; statistical significance was determined using the 
unpaired two-tailed Student’s t-test for a, b, f, g and the unpaired two-tailed 
Mann-Whitney U-test for ¢, h; *P < 0.05; **P< 0.01. 


Overall, we found that neutrophils promote tumour progression, con- 
sistent with findings showing that the presence of neutrophils within 
tumours and their increase in peripheral blood indicate a poor prognosis 
in melanoma patients”®”’. In an unselected sentinel node-staged patient 
cohort treated at our institution we verified that superficial ulceration 
and prominent neutrophil infiltration of primary melanomas is assoc- 
iated with signs of angiotropism and lymph node metastasis (Extended 
Data Fig. 9), confirming published reports of select cases®. Our experi- 
mental results indicate that neutrophilic inflammation in ulcerated 
melanomas shifts melanoma cells towards a migratory phenotype, acti- 
vates endothelial cells and fosters reciprocal melanoma-endothelial 
interactions, which together promotes angiotropic invasion and meta- 
stasis. This pathogenetic connection also explains why adjuvant admin- 
istration of type I interferons, which inhibit neutrophil function and 
angiogenesis”, is particularly effective in patients with ulcerated prim- 
ary melanomas”. 

We show that UV irradiation not only causes tumour-initiating 
genomic alterations in melanocytes but also promotes their perivas- 
cular expansion and metastatic dissemination through TLR4/MYD88- 
driven neutrophilic inflammation initiated by HMGB1 release from 
UV-damaged keratinocytes (Extended Data Fig. 10). In addition to 
TNF, it is probable that other cytokines also enhance angiotropism of 
melanoma cells, however, dissecting their individual contribution will 
require further investigations with the respective knockout mice. Our 
work is in accordance with increasing evidence that HMGB1 and other 
endogenous TLR ligands critically contribute to tumour-promoting 
inflammatory networks*”, and that neutrophils are important players 
in this process****”’, Neutrophils may be particularly important in mel- 
anomas which frequently secrete large amounts of CXCR2 ligands”. 
In addition to their role in primary melanomas shown in this work, 
neutrophils can also promote tumour-endothelial interactions in the 
lungs to facilitate metastatic seeding through diverse and complex 
mechanisms” ~”’. This includes the recently discovered role of neutro- 
phil extracellular DNA traps, which also activate the coagulation sys- 
tem and probably represent a powerful innate immune stimulus”. 

Using immunocompetent HGF-CDK4(R24C) mice in combination 
with ex vivo and in vitro assays, here we present pathophysiologically 
relevant model systems that open up new avenues to further study 
the complex molecular and cellular mechanisms governing reciprocal 
melanoma-endothelial cell interactions at the primary tumour site. 
These experimental tools also provide opportunities to delineate novel 
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treatment strategies that impair metastatic tumour progression, includ- 
ing targeted inhibition of selective motility cues or of neutrophilic 
inflammation driven by HMGB1 and the TLR4/MYD88 signalling 


axis*”. 

METHODS SUMMARY 

Animal experiments and UV irradiation. Melanoma-prone HGF-CDK4(R24C) 
mice in the C57BL/6 background were bred as described previously'®. HGF- 
CDK4(R24C) mice with carcinogen-initiated primary melanomas or C57BL/6 
wild-type as well as Tid’, Myd88‘~ and Myd88™**" mice bearing serial mel- 
anoma skin transplants were exposed to 4.5 kJ m~* UV twice weekly and tumour 
development was monitored. In some experiments myeloid immune cells were 
depleted or the HMGB1/TLR4 signalling axis was blocked pharmacologically. All 
animal experiments were approved by the local government authorities (LANUV, 
NRW, Germany) and were performed according to the institutional and national 
guidelines for the care and use of laboratory animals. 

Histology, immunostaining, electron microscopy, flow cytometry and RT- 
PCR. Angiotropism of melanomas, skin inflammatory responses and metastatic 
dissemination were analysed by H&E staining. Immunostaining was performed 
with antibodies for melanoma cells, activated keratinocytes, endothelial cells and 
immune cells. Ultrastructural analyses were performed with transmission electron 
microscopy. The composition of immune cells in tissues and blood was deter- 
mined by flow cytometry. Gene expression was quantified by real-time RT-PCR 
analyses. Primers are given in Supplementary Table 4. 

Organ and cell culture experiments. Primary keratinocytes were cultured from 
newborn skin, irradiated with UV and the release of HMGB1 determined by immu- 
nofluorescence and immunoblot analyses. The effect of pro-inflammatory med- 
iators on the migratory behaviour of melanoma cells towards and on endothelial 
cell surfaces was investigated in transwell and novel two-dimensional cell surface 
migration assays in vitro. Melanoma-endothelial cell interactions were further 
studied in co-cultures of EGFP-expressing tumour cells with red fluorescent aortic 
ring explants. An ear tissue invasion assay was established in which fluorescent 
melanoma cells were seeded onto ear tissue explants from UV-irradiated or control 
mice and allowed to migrate into the dermis. Ear tissue whole mounts were immu- 
nostained for endothelial cells and imaged with confocal laser-scanning immuno- 
fluorescence microscopy. 


Online Content Any additional Methods, Extended Data display items and Source 
Data are available in the online version of the paper; references unique to these 
sections appear only in the online paper. 
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Sulphoglycolysis in Escherichia coli K-12 closes a gap 
in the biogeochemical sulphur cycle 


Karin Denger', Michael Weiss”, Ann-Katrin Felux*, Alexander Schneider’, Christoph Mayer’, Dieter Spiteller', Thomas Huhn’*, 


Alasdair M. Cook! & David Schleheck* 


Sulphoquinovose (SQ, 6-deoxy-6-sulphoglucose) has been known 
for 50 years as the polar headgroup of the plant sulpholipid’” in the 
photosynthetic membranes of all higher plants, mosses, ferns, algae 
and most photosynthetic bacteria’. It is also found in some non- 
photosynthetic bacteria‘, and SQ is part of the surface layer of some 
Archaea’. The estimated annual production of SQ‘ is 10,000,000,000 
tonnes (10 petagrams), thus it comprises a major portion of the organo- 
sulphur in nature, where SQ is degraded by bacteria®’. However, 
despite evidence for at least three different degradative pathways 
in bacteria®*, no enzymic reaction or gene in any pathway has been 
defined, although a sulphoglycolytic pathway has been proposed’. 
Here we show that Escherichia coli K-12, the most widely studied 
prokaryotic model organism, performs sulphoglycolysis, in addi- 
tion to standard glycolysis. SQ is catabolised through four newly 
discovered reactions that we established using purified, heterolo- 
gously expressed enzymes: SQ isomerase, 6-deoxy-6-sulphofructose 
(SF) kinase, 6-deoxy-6-sulphofructose-1-phosphate (SFP) aldolase, 
and 3-sulpholactaldehyde (SLA) reductase. The enzymes are encoded 
in a ten-gene cluster, which probably also encodes regulation, trans- 
port and degradation of the whole sulpholipid; the gene cluster is 
present in almost all (>91%) available E. coli genomes, and is wide- 
spread in Enterobacteriaceae. The pathway yields dihydroxyacetone 
phosphate (DHAP), which powers energy conservation and growth 
of E. coli, and the sulphonate product 2,3-dihydroxypropane-1- 
sulphonate (DHPS), which is excreted. DHPS is mineralized by other 
bacteria, thus closing the sulphur cycle within a bacterial community. 
Recent work showed that environmental isolates of Klebsiella spp. 
(Enterobacteriaceae) convert SQ quantitatively to DHPS”®*, and we pro- 
posed that utilization of SQ might be a property of Enterobacteriaceae. 
We found that four genome-sequenced E. coli K-12 substrains (BW25113, 
DH1, MG1655 and W3100), after subculturing, grew with SQ within 
1 to 3 days. We chose to work (largely) with the fastest-growing sub- 
strain, MG1655. The organism used SQ as a sole source of carbon and 
energy with a molar-growth yield of 3 g of protein per mol of SQ carbon, 
whereas glucose gave approximately 6 g of protein per mol of carbon; 
the latter value represented mass balance of glucose carbon as biomass 
and CO) (ref. 9). However, approximately 1 mol of DHPS per mol of 
SQ was released into the growth medium (Fig. 1a), as observed with 
Klebsiella oxytoca®. Thus, there was complete mass balance for carbon 
and for sulphur from SQ. The growth rate with SQ was 0.13 h ‘05h + 
with glucose), and the specific degradation rate for SQ in vivo was 
120 mU per mg of protein (1 mU = 1 nmol min” '). We concluded 
that SQ is metabolized to a C3 sulphonate, which is excreted as DHPS, 
and that the remainder of the molecule is used for growth (Fig. 2a). 
The out-grown culture was filter-sterilized and inoculated with 
Cupriavidus pinatubonensis JMP 134, which can utilize DHPS for growth”, 
but cannot utilize SQ®. C. pinatubonensis grew with the DHPS formed 
from SQ by E. coli, and released its sulphonate sulphur quantitatively 
as sulphate (Fig. 1b) using a pathway described elsewhere’®. We thus 
demonstrated mineralization of SQ in a laboratory model system. 


Proteins from whole cells of E. coli K-12 grown with glucose or 
SQ were subjected to two-dimensional polyacrylamide gel electropho- 
resis (2D-PAGE) (Extended Data Fig. 1) and examined by peptide 
fingerprinting—-mass spectrometry (PF-MS) (Extended Data Table 1). 
The immediately relevant, apparently SQ-inducible proteins (see Extended 
Data Fig. 1 and Extended Data Table 1) were attributed to b3878 (also 
known as yihQ (b numbers are locus tags); predicted to be an o- 
glucosidase), b3879 (also known as yihR; predicted to be an epimerase), 
b3880 (yihS; predicted to be an isomerase), b3881 (yihT; predicted to 
be an aldolase) and b3882 (yihU; predicted to be an NAD*/NADH- 
linked dehydrogenase/reductase). Transcriptional analyses for the gene 
cluster b3879-b3882, as well as for b3883 (also known as yih V; predicted 
to bea sugar kinase), confirmed a strong inducible transcription during 
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Figure 1 | Complete degradation of sulphoquinovose during growth. 

a, Growth of E. coli K-12 substrain MG1655 with SQ and excretion of 
2,3-dihydroxypropane-1-sulphonate (DHPS). b, Growth of C. pinatubonensis 
JMP134 with the DHPS formed from SQ by E. coli. Data from representative 
growth experiments (n = 3) are shown. To allow a compact graph, sulphate 
release and not total sulphate is shown. 
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Figure 2 | The four core enzyme reactions of sulphoglycolysis, with 
transport, and the corresponding genes in a ten-gene cluster in E. coli 
K-12. a, SQ is metabolized by four enzymes (shown in colour) to a C3 
sulphonate, DHPS, which is excreted, and the remainder of the molecule is 
used for growth. For comparison, the analogous enzyme reactions for the 
catabolism of (unsubstituted) glucose through the glycolytic pathway in 

E. coli are also shown (dashed arrows). Fba, fructose bisphosphate 
aldolase; GAP, glyceraldehyde-3-phosphate; Pfk, phosphofructokinase; 
Pgi, phosphoglucose isomerase; PTS, phosphotransferase system permease; 


growth with SQ, but not during growth with glucose (Extended Data 
Fig. 2). Furthermore, single-gene knockouts (in substrain BW25113 
(ref. 11)) in genes b3876 (also known as yihO; predicted to be a major 
facilitator superfamily (MFS)-type transporter), b3880, b3881 and b3883 
did not grow with SQ, which confirmed and expanded on the proteo- 
mic and transcriptional data (Fig. 2b). 

We thus identified a gene cluster in E. coli K-12 that contained SQ- 
inducible, essential genes for catabolism of SQ, but we still did not know 
which pathway was involved. A sulphoglycolytic pathway would involve 
a hypothetical 3-sulpholactaldehyde (SLA) reductase to yield DHPS 
in the final reaction (apart from export) (Figs 1a and 2a), whereas a 
hypothetical SQ dehydrogenase as the first reaction would lead into 
hypothetical Entner-Doudoroff-type or pentose-phosphate-type path- 
ways, or another novel pathway. An SLA reductase was detected (assayed 
as DHPS oxidation) in cell-free extracts of SQ-grown substrain MG1655 
at a specific activity of 420 mU per mg of protein, which exceeds the 
specific degradation rate for SQ in vivo and, thus, was sufficient to 
explain growth. This enzyme activity was not detected in extracts of 
glucose-grown cells. Thus the enzyme was confirmed to be inducible, 
and it was specific for NAD*; NADP* was not a substrate. Further- 
more, SQ did not lead to reduction of NAD* or of NADP* in the 
extracts of SQ- or glucose-grown cells, hence, hypothetical SQ dehy- 
drogenase was not detectable. These data led us to predict the sulpho- 
glycolytic pathway depicted in Fig. 2a, including the requirement for 
sulphonate import and export across the cell membrane’*"*. 

The four predicted core enzymes of the pathway (Fig. 2a) were heter- 
ologously expressed and purified as His-tagged proteins, b3880 (putative 
isomerase), b3883 (putative sugar kinase), b3881 (putative aldolase) 
and b3882 (putative reductase) (Extended Data Fig. 3). Protein b3882 
was shown to encode SLA reductase. First, we partially purified and 
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Tpi, triose phosphate isomerase. b, The EcoGene E. coli website (http:// 
www.EcoGene.org) uses the abbreviation yih for most of these genes; we 
have retained this nomenclature. Vertical stripes, genes confirmed as being 
essential for growth with SQ by mutational analysis; horizontal stripes, genes 
confirmed as being inducible for growth with SQ by proteomic and/or 
transcriptional analyses; box-framed genes, genes encoding the four core 
enzymes of the pathway (shown in a) that were subject of heterologous 
expression, purification and functional characterization. 


identified (PF-MS) the wild-type enzyme in cell-free extracts of sub- 
strain MG1655 (see above), and second, we examined the recombinant 
protein (see below). In both cases we identified that b3882 represents an 
SLA reductase; the enzyme showed no activity with 4-hydroxybutyrate”’. 

The heterologously expressed and purified putative isomerase (b3880) 
caused about one-fourth of the SQ in the reaction mixture to dis- 
appear, as observed by high-pressure liquid chromatography—mass spec- 
trometry (HPLC-MS), and a new peak was formed that eluted with 
shorter retention time, but exhibited the same relative mass (M, = 244 
Dalton (Da); observed as a quasi-molecular ion in the negative ion 
mode ([M-H] ) ata mass-to-charge ratio (m/z) of 243) (Fig. 3a, b). The 
new peak was confirmed to represent 6-deoxy-6-sulphofructose (SF), as 
proposed elsewhere’, by the HPLC separation pattern (Extended Data 
Fig. 4), by the matching exact mass of the [M-H] ion (Extended Data 
Fig. 4), and by its MS—MS fragmentation pattern (Extended Data Fig. 5). 
Thus we confirmed that b3880 catalysed the SQ isomerase reaction. 

The reaction mixture was augmented with ATP and the putative 
sugar kinase (b3883). The peaks of SQ and SF partially disappeared 
and a new peak was formed (Fig. 3c). This new peak was confirmed to 
represent 6-deoxy-6-sulphofructose-1-phosphate (SFP), as proposed 
elsewhere’, by the matching exact mass of the [M-H] ion (observed 
mass, 322.9877 Da; theoretical mass of CgH,20,,;PS , 322.9843 Da) 
and by its fragmentation pattern (Extended Data Fig. 6). HPLC con- 
firmed that ATP disappeared and ADP was formed during the reac- 
tion and, furthermore, that SFP was converted back to SF when alkaline 
phosphatase was added to a preparation of SFP (not shown). Thus, 
with b3883, we expressed an ATP-dependent kinase that phosphory- 
lated SF to SFP. 

The reaction mixture was augmented with the putative aldolase (b3881). 
The peak for SFP partially disappeared, and two new peaks were formed 
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Figure 3 | Illustration of the reactions of the four core enzymes of NADH and reductase (b3882) (f = 240 min). f, Sample after extended 


sulphoglycolysis in vitro. The transformation of SQ to SF, SFP, DHAP and 
SLA, and DHPS, by successive addition of recombinantly expressed pathway 
enzymes was followed by HPLC-ESI-MS. a, Sample of SQ in reaction buffer 
(t = 0 min). b, Sample after addition of isomerase (b3880) (t = 60 min). 

c, Sample after addition of ATP and kinase (b3883) (t = 120 min). d, Sample 
after addition of aldolase (b3881) (ft = 180 min). e, Sample after addition of 


(Fig. 3d). The first new peak was identified to represent DHAP, as pro- 
posed elsewhere’, with an authentic DHAP standard. The second new 
peak was confirmed to represent SLA, as proposed elsewhere’, by the 
matching mass of the [M-H] ion (M, = 154 Da; observed as [M-H] ~ 
ion at m/z = 153) and by its fragmentation pattern (Extended Data 
Fig. 7); the same peak was observed when we used recombinant SLA 
reductase in reverse to oxidize DHPS to SLA (see above). Thus, with 
b3881, we expressed an aldolase that cleaved SFP into DHAP and SLA. 
The SFP turnover was incomplete (Fig. 3d); the equilibrium of the 
corresponding enzyme reaction in glycolysis (fructose-1,6-bisphosphate 
aldolase) lies far to the left’®, that is, hardly any products are formed. 

However, when NADH and the recombinant SLA reductase (b3882) 
were added, the peak for SFP was further diminished, as was the peak 
for SQ, and that for the aldolase-reaction product DHAP was further 
increased (Fig. 3e). In addition, the peak for SLA had disappeared, and 
the peak for the anticipated sulphonate product, DHPS, was formed 
(Fig. 3e and Extended Data Fig. 7). After an extended incubation of 
the four-enzyme reaction (see Fig. 3e, f), the peaks for SQ and SFP had 
almost completely disappeared, and the peaks for DHAP and DHPS 
had further increased. 

Together, the results show that the SQ pathway in E. coli K-12 (Fig. 2a) 
does not involve a desulphonation reaction and that no substrate-level 
phosphorylation of the sulphonated C; intermediate occurs, which has 
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incubation of the four-enzyme reaction (ft = 360 min). The total-ion 
chromatograms (TICs) recorded in the negative-ion mode from the MS-MS 
fragmentation of the quasi-molecular ions [M-H] of SQ and SF, SFP, DHAP, 
SLA and DHPS, from a representative experiment (n = 5) are shown. For 
representative MS-MS fragmentation patterns of the [M-H] ions of SQ and 
SF, SFP, and SLA and DHPS, see Extended Data Figs 5, 6 and 7, respectively. 


been used previously’ as a default hypothesis. Furthermore, we deduce 
that there are ten genes in the gene cluster (Fig. 2b). The core pathway 
comprises a SQ transporter (for example, b3876, YihO), SQ isomerase 
(b3880, YihS), SF kinase (b3883, YihV), SEP aldolase (b3881, YihT), SLA 
reductase (b3882, YihU) and a DHPS exporter (for example, b3877, 
YihP), which could be under the putative control of repressor b3884 
(YihW). We propose a sulpholipid porin (b3875, OmpL), a sulpholi- 
pid «-glucosidase (b3878, YihQ), and an epimerase (b3879, YihR) to 
funnel other SQ derivatives into the pathway; for example, the whole 
sulpholipid (see Extended Data Fig. 8). 

The gene cluster is found in at least 1,009 (>91%) of the 1,110 genome 
sequences of commensal E. coli, as well as pathogenic E. coli (for example, 
EHEC) strains, that were available in November 2013 (finished and 
draft genome sequences) in the Integrated Microbial Genomes (IMG) 
and Human Microbiome Project (HMP) databases (that is, gene clus- 
ters with syntenic yihTUVW and collinear homologues of yihSRQPO 
and ompL in variable order). Hence, the gene cluster is a feature of the 
core-genome of E. coli species. It can also be found in a wide range of 
other Enterobacteriaceae (for example, Chronobacter sakazakii ATCC 
BAA-894, Klebsiella oxytoca 10-5242, Pantoea ananatis LMG 20103 
and Salmonella enterica LT2). We therefore suspect that the pathway 
has a significant role in bacteria in the alimentary tract of all omnivores 
and herbivores, that the pathway occurs in bacteria in excrement from 
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these animals, and that it occurs in plant pathogens, which would 
explain part of the widespread occurrence of microbial degradation 
observed**. 

SQ is produced in large amounts in nature and thus represents a 
significant proportion of the organic sulphur cycle’, and it is degraded 
in similar amounts by both bacteria®* and algae’, or it would accu- 
mulate in the environment. We see here that the Enterobacteriaceae 
use one pathway (Fig. 2a) to initiate degradation of SQ, and that a com- 
munity is required for complete degradation (Fig. 1b)*. This covers a 
variety of habitats, but we know that other pathways exist. A previous 
paper’ presented evidence for SQ dehydrogenase, which we also observe 
in our SQ-using strain of Pseudomonas putida (A.-K.F. unpublished 
observations). Notably, another group® reported complete SQ degra- 
dation, including desulphonation, in a single organism; however, this 
organism has been lost’®. 

In summary, we have established that sulphoglycolysis, which was 
named but not defined in a previous report”’, converts SQ to DHPS in 
the most widely studied prokaryotic model organism, E. coli K-12, rep- 
resenting many Enterobacteriaceae (Fig. 2a). We have identified a gene 
cluster in E. coli K-12 (Fig. 2b) that encodes the pathway. The core 
pathway, for SQ, involves four newly discovered enzymes, two newly 
identified transporters and three newly characterized intermediates 
(Fig. 2a, b). We know that the pathway is regulated (Extended Data 
Figs 1 and 2) and we suspect that it includes the degradation of the 
whole sulpholipid (Extended Data Fig. 8). The pathway represents a 
substantial part of the biogeochemical sulphur cycle, and the pathway 
is likely to have a significant role in bacteria in the alimentary tract of 
all omnivores and herbivores, and in plant pathogens. We and others® * 
anticipate other degradative pathways for SQ in nature; for example, in 
bacteria of all marine, freshwater and terrestrial habitats where SQ is 
produced and degraded. We now provide the tools to elucidate these 
degradative pathways. 


METHODS SUMMARY 


SQ and DHPS were synthesized chemically and identified by NMR and mass 
spectrometry*"”. Cultivation, preparation of cell-free extracts, enzyme purification, 2D- 
PAGE and PF-MS, RNA preparation and polymerase chain reaction with reverse 
transcription (RT-PCR), and expression and purification of His-tagged proteins, 
are described in the online Methods. SQ, SF, SFP, SLA, DHAP and DHPS were 
separated using hydrophilic interaction liquid chromatography (HILIC)* and 
detected by an evaporative light scattering detector (ELSD)* or electrospray ion- 
ization (ESI)-time-of-flight (TOF)-MS or ESI-iontrap-MS (see Methods). The 
enzyme reaction mixture (Fig. 3) was 3 mM SQ in 50 mM ammonium acetate buffer 
(pH 8.7), and 8 mM ATP, 0.5 mM MgCl, and 8 mM NADH supplemented with 
the corresponding enzymes (each 50 tg ml” '). 


Online Content Any additional Methods, Extended Data display items and Source 


Data are available in the online version of the paper; references unique to these 
sections appear only in the online paper. 
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The recognition events that mediate adaptive cellular immunity and 
regulate antibody responses depend on intercellular contacts between 
T cells and antigen-presenting cells (APCs)'. T-cell signalling is 
initiated at these contacts when surface-expressed T-cell receptors 
(TCRs) recognize peptide fragments (antigens) of pathogens bound 
to major histocompatibility complex molecules (pPMHC) on APCs. 
This, along with engagement of adhesion receptors, leads to the 
formation of a specialized junction between T cells and APCs, known 
as the immunological synapse’, which mediates efficient delivery of 
effector molecules and intercellular signals across the synaptic cleft’. 
T-cell recognition of pMHC and the adhesion ligand intercellular 
adhesion molecule-1 (ICAM-1) on supported planar bilayers reca- 
pitulates the domain organization of the immunological synapse*”’, 
which is characterized by central accumulation of TCRs’, adjacent to 
a secretory domain’, both surrounded by an adhesive ring*”. Although 
accumulation of TCRs at the immunological synapse centre corre- 
lates with T-cell function’, this domain is itself largely devoid of 
TCR signalling activity’*®, and is characterized by an unexplained 
immobilization of TCR-pMHC complexes relative to the highly 
dynamic immunological synapse periphery**. Here we show that 
centrally accumulated TCRs are located on the surface of extracel- 
lular microvesicles that bud at the immunological synapse centre. 
Tumour susceptibility gene 101 (TSG101)° sorts TCRs for inclusion 
in microvesicles, whereas vacuolar protein sorting 4 (VPS4)”* medi- 
ates scission of microvesicles from the T-cell plasma membrane. The 
human immunodeficiency virus polyprotein Gag co-opts this pro- 
cess for budding of virus-like particles. B cells bearing cognate pMHC 
receive TCRs from T cells and initiate intracellular signals in response 
to isolated synaptic microvesicles. We conclude that the immuno- 
logical synapse orchestrates TCR sorting and release in extracellular 
microvesicles. These microvesicles deliver transcellular signals across 
antigen-dependent synapses by engaging cognate pMHC on APCs. 

The nature of the biophysical environment that governs molecular 
domain organization at the immunological synapse remains unclear. 
Confinement of pMHC*, TCRs’ and cytoplasm (Supplementary Fig. 1) 
suggests that a general diffusion barrier separates TCRs and cytoplasm 
at the immunological synapse centre from the rest of the T cell. To better 
understand the basis for the observed central confinement of pMHC, 
TCRs and cytoplasm at the immunological synapse, we investigated 
CD4" T-cell immunological synapse formation using high-resolution 
optical imaging by total internal reflection fluorescence microscopy 
(TIRFM), integrated with transmission electron microscopy (TEM) 
and electron tomography. 

T cells from TcrAND transgenic mice’ form TCR microclusters in 
response to engagement by the cognate class II pMHC molecule I-E* 


complexed with the moth cytochrome C peptide MCC 88-103 (MCC- 
LE), Over approximately 10 min, TCR microclusters, together with 
bound pMHC*®”, are transported on the cell surface to the immuno- 
logical synapse centre, where they are consolidated into an immobilized 
domain®. To follow ultrastructural changes associated with immuno- 
logical synapse formation, TcrAND T cells were fixed after 5, 10, 15 and 
20 min of interaction with supported lipid bilayers containing MCC- 
I-E“ and ICAM-1, and imaged first by TIRFM and then by TEM. As 
a control, we used the non-cognate pMHC B2m-I-EX, which did not 
arrest motility or induce immunological synapse formation in TcrAND T 
cells (Supplementary Fig. 2a). TEM time series of TcrAND T cells form- 
ing immunological synapses on antigen-containing bilayers revealed 
changes in cell morphology that were characteristic of antigen-induced 
cell polarization (Supplementary Fig. 2b-d). Notably, at the 10 min time 
point, the centre of the T-cell contact interface showed an unexpected 
change in morphology, from a planar plasma membrane in continuous 
close apposition with the planar bilayer (Fig. 1a) to the appearance of 
numerous microvesicles (Fig. 1b and Supplementary Fig. 2e), approxi- 
mately 70 nm in diameter (Supplementary Fig. 3a), that were contained 
within a central extracellular cavity (Fig. 1b). Microvesicle formation 
was antigen-specific, as they did not form with bilayers containing Bym- 
I-E* (Fig. la and Supplementary Fig. 2e), and could be modulated by 
the potency of the activating ligand, or by provision of costimulation” 
(Fig. 1f and Supplementary Discussion). To visualize the distribution 
of microvesicles more clearly and verify their dissociation from the 
plasma membrane, we performed dual-axis tomography (Supplemen- 
tary Video 1) on four serial sections through an immunological synapse, 
ranging from 150-250 nm in thickness. The associated three-dimensional 
model (Fig. 1c-e and Supplementary Videos 2 and 3) of the joined tomo- 
grams demonstrated that discrete extracellular microvesicles, with no 
connection to overlying plasma membrane (Supplementary Fig. 4a, b), 
predominate in the central cavity, along with occasional membrane 
projections and membrane buds of nascent microvesicles (Supplemen- 
tary Fig. 4c-i). 

Comparison of the distributions of TCRs and microvesicles at the 
immunological synapse demonstrated that they were spatially corre- 
lated (Supplementary Fig. 3b-d). To establish whether TCRs present at 
the immunological synapse centre were associated with microvesicles, 
we developed a novel method for optical-electron microscopy correla- 
tion, based on registration of TIRFM and corresponding TEM images 
of immunological synapses aligned using a microfabricated grid (Sup- 
plementary Fig. 5). Electron tomography of T-cell-bilayer interfaces 
confirmed the presence of microvesicles within a roughly circular extra- 
cellular cavity at the immunological synapse centre (Fig. 1h, iand Sup- 
plementary Videos 4 and 5). Optical-electron microscopy correlation 
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Figure 1 | Antigen-induced release of TCR-enriched microvesicles at the 
centre of immunological synapse. a, b, Transmission electron micrographs of 
TcrAND T cells interacting with supported lipid bilayers (SLB) containing 
ICAM-1, and the indicated pMHC. Inset, low magnification images of the same 
cells. Scale bar, 500 nm; inset scale bar, 2 um; red arrowheads, extracellular 
microvesicles; white arrowheads, plasma membrane. c, A three-dimensional 
ultrastructural model of the immunological synapse centre. The model was 
constructed from dual-axis tomograms of four serial sections made orthogonal 
to the bilayer from a single immunological synapse (numbered in e). ILV, 
intraluminal vesicle; MVB, multivesicular body; scale bar, 250 nm. d, Model 
from c with intracellular components and plasma membrane removed. Scale 
bar, 250 nm. e, Model from d rotated 90° in the x axis. Examples of an 
extracellular microvesicle (i), tethered vesicle (ii) and plasma membrane 
projection (iii) are indicated. Dotted lines, section boundaries; scale bar, 

250 nm. f, Quantification of microvesicle production by TcrAND T cells in 
response to MCC-I-E* (100 molecules per jum”) or the weak agonist K99A-I- 
E* (100 molecules per tm’), or with additional co-stimulation provided by 
CD80 and CD48 on bilayers (200 molecules per tm” for both). Microvesicle 


then allowed us to assign molecular distributions to immunological syn- 
apse ultrastructure. This revealed that TCR fluorescence at the immuno- 
logical synapse (Fig. 1g, j) corresponded almost exclusively to extracellular 
microvesicles contained within the cavity at the immunological synapse 
centre (Fig. 1], n). The characteristic F-actin depletion at the immuno- 
logical synapse centre’’ (Fig. 1g, k) is also circumscribed by the plasma 
membrane boundary of the central cavity (Fig. 10), reflecting interruption 
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production by OTII T cells, in response to ovalbumin-I-A® pMHC ligand 
(100 molecules per um’), was also measured. n is indicated above bars. Results 
are pooled from two independent experiments. g-o, Optical-electron 
tomography correlation. g, Overlay of TCR (red) and F-actin (green) 
fluorescence in TIRFM images of the TcrAND T cell immunological synapse. 
Scale bar, 3 tim, which also applies to j and k. h, A three-dimensional model of 
the immunological synapse centre constructed from an electron tomogram 
taken parallel to the T-cell-bilayer interface in g. T-cell plasma membrane, 
green; microvesicles, orange. The model is overlaid on a slice of the tomogram, 
and shown separately in i (scale bar in 1, 500 nm, which also applies to h, i and 
m-o). j, k, Overlay of TCR (j) and F-actin (k) fluorescence from g, shown as 
a heat map, with the electron micrograph of the same T-cell-bilayer interface. 
Dashed boxes and asterisk in g, j and k indicate the orientation of the 
immunological synapse in relation to the tomogram and model overlays in 

h, i and 1-o. 1, m, Overlay of microvesicles modelled in i (white), scaled and 
aligned with the TCR fluorescence in j (1) and F-actin in k (m). n, 0, Overlay of 
the T cell plasma membrane is modelled in i (white) scaled and aligned with the 
TCR fluorescence from j (n) and F-actin from k (0). 


of cytoplasmic continuity at the immunological synapse centre. The 
annular ICAM-1-LFA-1 adhesive domain, which surrounds centrally 
accumulated TCRs (and associated MCC-I-E), is wholly contained 
within the plasma membrane border of the central cavity (Supplemen- 
tary Fig. 6), which effectively precludes membrane-tethered LFA-1 
from binding ICAM-1 at the immunological synapse centre. Our find- 
ing that TCRs are present in extracellular microvesicles also accounts 
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for the inability of centrally accumulated TCRs to exchange with TCRs 
elsewhere in the T cell (Supplementary Fig. 1a, b), and for the confined 
cytoplasmic diffusion observed at the immunological synapse centre 
(Supplementary Fig. 1c). 

After 30-45 min, TcrAND T cells break immunological synapse 
symmetry and resume motility”. Migrating T cells left behind 80% of 
TCRs in particles on the bilayer (Supplementary Figs 7a, c and 8a and 
Supplementary Video 6). Optical-electron microscopy correlation of 
TcrAND T cells resuming motility after immunological synapse for- 
mation confirmed that these particles were extracellular TCR-enriched 
microvesicles released from the immunological synapse centre (Fig. 2a, b). 
The microvesicles diffused apart, which allowed us to characterize their 
composition by indirect immunofluorescence labelling and TIRFM 
(Supplementary Fig. 8b). TCR-enriched microvesicles were devoid of 
tyrosine phosphorylation (Fig. 2c), consistent with termination of TCR 
signalling. TCR-enriched microvesicles clustered MCC-I-E', but not 
B2m-I-E*, on bilayers (Fig. 2d and Supplementary Figs 7d and 9), estab- 
lishing that TCRs were on the external surface of microvesicles and 
were capable of binding cognate pMHC. 

To investigate the mechanism by which microvesicles were produced, 
we next studied the role of the endosomal sorting complex required for 
transport (ESCRT)-I member TSG101, which was previously impli- 
cated in ubiquitin-dependent sorting of TCRs to the immunological 
synapse centre°. ESCRT-I proteins, such as TSG101, typically carry out 
the first in a series of steps leading to vesicle budding and fission’*. We 
suppressed TSG101 expression in TcrAND T cells with small interfer- 
ing RNA (siRNA), and treated cells with non-targeting small oligo-RNA 
(sRNA) as a control (Supplementary Fig. 10a). As expected, TcrAND T 
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Figure 2 | TCR-enriched microvesicles are post-signalling extracellular 
products of T-cell activation that retain pMHC-binding competence. 

a, b, Optical-electron microscopy correlation of sessile TcrAND T cells 
forming an immunological synapse in response to MCC-I-E* and ICAM-1 
(a) and resuming motility (b), showing the distributions of F-actin (green) and 
TCRs (red) in relation to released microvesicles deposited on the bilayer 
(arrowheads). Scale bar, 4 im. c, Quantification of phosphotyrosine (pY) 
fluorescence intensity in arbitrary units (AU) of released TCR-enriched 
microvesicles (V) and their originating cells (C) detected by indirect 
immunofluorescence labelling and TIRFM. d, Enrichment of fluorescently 
labelled cognate MCC-I-E* or non-binding B2m-I-E* by released TCR- 
enriched microvesicles. Fold pMHC enrichment is defined as: (pMHC 
fluorescence intensity colocalized with TCR-enriched microvesicles)/ 

(free bilayer pMHC fluorescence intensity). Error bars represent s.e.m. 


120 | NATURE | VOL 507 | 6 MARCH 2014 


cells, in which TSG101 expression was suppressed, failed to translocate 
TCRs, together with bound MCC-I-E‘, to the immunological synapse 
centre, whereas central TCR accumulation was unaffected by treat- 
ment with control sRNA (Supplementary Fig. 10b, c). Optical-electron 
microscopy correlation revealed that suppression of TSG101 resulted in 
accumulation of TCRs, together with bound MCC-I-E‘ in a ring around 
the central cavity defining the immunological synapse centre (Fig. 3a), 
whereas treatment with control sRNA had no effect on TCR incorpora- 
tion into microvesicles (Supplementary Fig. 10d). Suppression of TSG101 
reduced, but did not abolish, microvesicle production (Supplementary 
Fig. 11). Notably, the remaining microvesicles present at the immuno- 
logical synapse centre were devoid of TCRs, demonstrating that TSG101 
was necessary for sorting of TCRs into microvesicles (Fig. 3a). 

We next investigated the role of the ESCRT-III complex, which is 
responsible for the final steps in vesicle fission. We used primary human 
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Figure 3 | Biogenesis of TCR-enriched microvesicles is mediated by ESCRT 
proteins and antagonized by HIV Gag. a, Optical-electron microscopy 
correlation of TCR and MCC-I-E* distributions of a representative TcrAND 
T cell following siRNA-mediated TSG101 suppression. Scale bar, 3 [m. 

b, Electron micrographs of the T-cell-bilayer interface of human CD4* T cells 
forming an immunological synapse in response to TCR engagement. Cells were 
transfected with a construct encoding dominant-negative VPS4 fused to GFP 
(VPS4dn-GFP) or GFP only. Red boxes and arrowheads indicate regions 
imaged at higher magnification in panels on the right. Arrowheads in c indicate 
nascent vesicles tethered to the plasma membrane. d, Quantification of limiting 
membrane-tethered microvesicles expressed as a percentage of total 
microvesicles within central compartments of the immunological synapse in 
cells expressing indicated constructs. Means and s.d. are shown, P value is for 
Student’s t-test. n is indicated above data bars. Results are pooled from two 
independent experiments. e-g, Representative optical-electron microscopy 
correlation of the immunological synapse in human CD4* T cells transfected 
with a construct encoding HIV Gag fused to GFP (Gag-GFP, green). Red, TCR; 
scale bar, 3 um. e, T cell without central Gag-GFP accumulation resuming 
migration and releasing TCR-enriched microvesicles (arrowhead). f, T cell 
forming an immunological synapse with centrally accumulated Gag—GFP. 

g, T cell with centrally accumulated Gag-GFP resuming motility and releasing 
Gag-GFP-containing microvesicles (arrowheads). h, Higher magnification 
image of boxed region in g showing internal juxta-membrane density in 
Gag-containing microvesicles. Arrowhead, plasma membrane; scale bar, 

500 nm. 
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CD4* T cells which, like TercAND murine T cells, produce TCR-enriched 
microvesicles at the immunological synapse centre (Supplementary 
Fig. 12). We transfected human CD4" T cells with a construct encoding 
a dominant-negative mutant of human VPS4(E228Q), that prevents ATP 
binding, fused to green fluorescent protein (VPS4dn-GFP)". Expres- 
sion of VPS4dn-GFP disrupts the function of endogenous VPS4”’, which 
binds to and catalytically disassembles'* membrane-associated ESCRT- 
III oligomers and is thought to be obligatory for sustaining membrane 
budding reactions!*"*. As a control, we transfected human CD4* T 
cells with a construct encoding GFP alone. Control T cells formed micro- 
vesicles at the immunological synapse centre, the majority of which 
were separated from the limiting plasma membrane of the central cavity 
(Fig. 3b, d). In marked contrast, the immunological synapse of VPS4dn- 
GFP-transfected T cells contained nascent microvesicles, within a frag- 
mented central compartment, which remained tethered to the limiting 
plasma membrane (Fig. 3b-d) at constricted bud necks (Fig. 3c). Consis- 
tent with the role of VPS4 ‘downstream’ of early acting ESCRTs, VPS4dn- 
GFP did not affect TCR sorting to the immunological synapse centre 
(Supplementary Fig. 13), although its distribution at the immunological 
synapse was altered when compared to wild-type VPS4 (Supplementary 


LETTER 


Figure 4 | TSG101 selectively controls TCR transfer to B cells that signal in 
response to pMHC engagement by microvesicle-tethered TCRs. a, Confocal 
microscopy of TcrAND T cells, treated with TSG101 suppressing siRNA 
(TSG siRNA) or control oligo-RNA (Control sRNA), forming conjugates with 
congenic splenic B cells loaded with fluorescent MCC peptide (cyan). 

T = 30 min.; red, TCR-C; arrowheads, transferred TCR and fluorescent MCC 
peptide; scale bar, 10 um. b. Quantification of TCR transfer to B cells in T-B 
conjugates. c. Quantification of MCC peptide transfer to T cells in T-B 
conjugates. NS, not significant. Means from three independent experiments are 
shown in b, c. Error bars represent s.d. 82m, non-stimulating control peptide. 
Con KD, control sRNA; TSG KD, knockdown with TSG101 siRNA. d, Live B 
cells loaded with calcium-sensitive fluorescent dye Fluo-4 were imaged by 
confocal microscopy on bilayers containing deposited TCR-enriched 
microvesicles (T = 30 min). Panels show differential interference contrast 
(DIC) and fluorescence images of TCR-enriched microvesicle patches on 
bilayers (red) and Fluo-4 fluorescence in B cells (green and heat map). Shown 
are examples of B cells interacting with TCR-enriched microvesicles (On), or 
on bilayer areas without detectable TCR-enriched microvesicles (Off). Scale 
bar, 10 uum. e, Quantification of Fluo-4 fluorescence intensity of B cells ‘On’ or 
‘Off TCR-enriched microvesicles. Data are representative of two experiments. 
f, Quantification of Fluo-4 intensity of B cells pulsed with MCC peptide or 
without antigen, after 30 min incubation on bilayers containing ICAM-1 alone, 
or with bilayers containing deposited TCR-enriched microvesicles. 
Fluorescence values were divided by the mean Fluo-4 intensity of B cells treated 
with 1 LM ionomycin at the end of the experiment. Data pooled from two 
independent experiments. Blue dots, fluorescence intensity in individual cells; 
red bars, sample mean; P value, one-way analysis of variance corrected for 

all comparisons. 


Fig. 14). These observations indicate that VPS4 is involved in fission of 
TCR-rich membrane buds, thereby producing TCR-enriched micro- 
vesicles at the immunological synapse centre’*"”. 

The ESCRT pathway is also exploited by the human immunodefi- 
ciency virus (HIV) structural polyprotein Gag*"* for membrane bud- 
ding and release of virus particles from the plasma membrane of infected 
cells. This prompted us to test whether Gag might interfere with the 
biogenesis of TCR-enriched microvesicles in human CD4* T cells. 
We expressed a construct encoding Gag fused to GFP (Gag-GFP) in 
primary human CD4* T cells, and analysed immunological synapse 
formation by optical-electron microscopy correlation. Transfected T 
cells adhered poorly to supported bilayers containing ICAM-1 alone, 
with small Gag-GFP puncta present throughout the contact interface 
(Supplementary Fig. 15b, c). Notably, TCR engagement led to robust 
recruitment of Gag-GFP to the contact interface (Supplementary 
Fig. 15b, c), where it accumulated at the immunological synapse centre, 
displaced TCRs (Fig. 3f and Supplementary Fig. 15d, e), and resulted in 
the release of Gag—GFP-containing microvesicles, which are analogous 
to virus-like particles (VLP) that are produced during HIV infec- 
tion (Fig. 3g, h and Supplementary Fig. 15f, g). Conversely, in T cells 
with low Gag-GFP expression, TCRs accumulated normally at the immu- 
nological synapse centre (Supplementary Fig. 15d, e), resulting in the 
release of TCR-enriched microvesicles (Fig. 3e and Supplementary 
Fig. 15f, g). The ESCRT-binding domain of Gag is required for its 
recruitment and central accumulation at the immunological synapse 
following TCR engagement (Supplementary Fig. 16 and Supplemen- 
tary Discussion). Taken together, these findings indicate that Gag antag- 
onizes ESCRT-dependent sorting of TCRs into microvesicles, and in 
this context TCR ligation directs the polarized budding of VLPs at the 
centre of the immunological synapse. 

To verify that microvesicles have a physiological role in cellular inter- 
actions, we next asked whether TcrAND T cells produce TCR-enriched 
microvesicles in conjugates with antigen-bearing B cells. As expected, 
TcrAND T cells readily accumulated TCRs at the immunological syn- 
apse centre in conjugates with MCC-pulsed congenic B10.Br B cells 
(Supplementary Fig. 17a, b). In approximately 40% of T-B cell conju- 
gates (Fig. 4a, b), small puncta of TCRs could be detected in B cells ‘distal’ 
to the immunological synapse, consistent with T cell to B cell synaptic 
transfer of TCR-containing microvesicles, whereas no T-cell polariza- 
tion or TCR transfer was detected in conjugated B cells pulsed with 
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B2m peptide (Supplementary Fig. 17a, b). Because TSG101 is critical 
for the production of TCR-enriched microvesicles in the planar bilayer 
model, we next investigated whether TCR transfer to live B cells is 
affected by siRNA-mediated suppression of TSG101°. In conjugates 
of TSG101-suppressed TcrAND T cells and MCC-pulsed B cells, transfer 
of TCRs was reduced by approximately 80% (Fig. 4a, b), relative to con- 
jugates treated with control sRNA, despite effective TCR polarization 
to the immunological synapse (Supplementary Fig. 17b). As previously 
described, transfer of pMHC from B cells to T cells was also observed 
(Fig. 4a, c and Supplementary Fig. 17a)'*"°. However, this pMHC 
transfer was unaffected by TSG101 silencing in the T cell (Fig. 4a, c), 
demonstrating that TSG101 selectively controls TCR transfer to B cells 
during bidirectional membrane exchange in T-B cell conjugates”. 

We next asked whether isolated microvesicles induce signalling in 
antigen-presenting B cells. Engagement of MHC II molecules on B 
cells triggers tyrosine phosphorylation and intracellular calcium signal- 
ling, by coupling to the B-cell antigen receptor signalling machinery”®, 
resulting in cell proliferation in primed B cells”'. We therefore asked 
whether B cells presenting cognate MCC-I-E* complexes are activated 
by specific TCRs present on microvesicles. We introduced B cells, loaded 
with the Ca**-sensitive dye Fluo-4, onto bilayers containing TCR- 
enriched microvesicles, from which T cells had been removed (Sup- 
plementary Fig. 17c), and monitored Ca”* signalling by confocal 
microscopy (Fig. 4d and Supplementary Fig. 17d). B cells pulsed with 
MCC were motile on bilayers, but stopped and showed sustained 
increase in intracellular Ca~* on encountering TCR-enriched micro- 
vesicle patches on bilayers (Fig. 4d, e). In contrast, B cells not loaded 
with MCC did not show an increase in Ca** relative to controls (that 
is, cells on bilayers with ICAM-1 alone, Fig. 4d, f). Thus, B-cell signalling 
by pMHC is initiated by cognate recognition of TCRs on microvesicles. 
In support of a role for TCR-enriched microvesicles during human 
T-B cell interaction, we found that Raji B cells, in superantigen-induced 
conjugates with human CD4" T cells”, received TCR puncta that acti- 
vate phospholipase Cy1 (PLCy1), a key mediator of intracellular Ca** 
signalling” (Supplementary Fig. 18). 

Our finding that the immunological synapse centre is an extracellular 
cavity, filled with TCR-enriched microvesicles by an ESCRT-dependent 
mechanism, provides a fundamentally new model for supramolecular 
domain organization at the immunological synapse (Supplementary 
Fig. 19a). Shedding of TCRs in microvesicles constitutes a novel mech- 
anism for TCR ‘downregulation’, following engagement by pMHC, 
that acts in parallel with receptor internalization™*. Our observations 
raise the possibility that other immune cells known to accumulate immun- 
oreceptors at the synapse centre, such as B*? and NK” cells, may also 
release them in microvesicles for intercellular communication. Cognate 
recognition of TCR-enriched microvesicles by pMHC on B cells may 
provide ‘help’ to B cells that is calibrated to the pMHC density present 
on their surface, pointing to a plausible mechanism for avidity-adjusted 
delivery of T-cell help to B cells’’. Finally, we find that TCR-enriched 
microvesicle biogenesis is a native pathway in T cells, triggered by anti- 
gen recognition, that may be co-opted by HIV proteins for polarized 
retroviral transmission at antigen-dependent immunological synapses 
(Supplementary Fig. 19b). 


METHODS SUMMARY 


Murine TcrAND T cells were expanded with MCC peptide and IL-2, and used once 
quiescent. Human CD4* T cells were isolated from peripheral blood by negative 
selection and used within 48 h. T cells clones were expanded for one cycle using 
heterologous irradiated peripheral blood mononuclear cells (PBMCs), phytohae- 
magglutinin (PHA) and IL-2 and used once quiescent. 1,2 dioleoyl-sn-glycero-3- 
phosphatidylcholine (DOPC) lipid bilayers containing 12.5 mol% 1,2-dioleoyl-sn- 
glycero-3-[(N-(5-amino-1-carboxypentyl)iminodiacetic acid)succinyl] (nickel salt) 
(DGS-NTA-Ni) were deposited on coverslips that were cleaned with peroxidated 
sulphuric acid and rinsed with pure water. Bilayers contained 100 molecules per jm” 
MCC-I-EX, and 200 molecules per jum? ICAM-1 unless otherwise stated. Optical 
imaging was performed using a Nikon TIRF microscope and an LSM510 confocal 
microscope. TEM imaging and tomography was performed using Phillips CM12, 
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CM200 and FEI Technai F20 microscopes. For optical-electron microscopy cor- 
relation, lipid bilayers were formed on coverslips containing chrome registration 
grids on which cells were deposited and fixed for imaging. Following TIRFM imag- 
ing of the entire grid, samples were embedded in Epon, and sectioned parallel to the 
bilayer. TEM images of the first parallel section contained the chrome grid imprint, 
allowing location, imaging and alignment of electron micrographs and tomograms 
with TIRFM images. Cells were transfected with DNA or RNA by nucleofection 
(Amaxa). 


Online Content Any additional Methods, Extended Data display items and Source 
Data are available in the online version of the paper; references unique to these 
sections appear only in the online paper. 
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SRA- and SET-domain-containing proteins link RNA 
polymerase V occupancy to DNA methylation 
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RNA-directed DNA methylation in Arabidopsis thaliana depends 
on the upstream synthesis of 24-nucleotide small interfering RNAs 
(siRNAs) by RNA POLYMERASE IV (Pol IV)? and downstream 
synthesis of non-coding transcripts by Pol V. Pol V transcripts are 
thought to interact with siRNAs which then recruit DOMAINS 
REARRANGED METHYLTRANSFERASE 2 (DRM2) to methylate 
DNA*’. The SU(VAR)3-9 homologues SUVH2 and SUVH9 act in 
this downstream step but the mechanism of their action is unknown*”. 
Here we show that genome-wide Pol V association with chromatin 
redundantly requires SUVH2 and SUVH9. Although SUVH2 and 
SUVH9 resemble histone methyltransferases, a crystal structure reveals 
that SUVH9 lacks a peptide-substrate binding cleft and lacks a prop- 
erly formed S-adenosyl methionine (SAM)-binding pocket neces- 
sary for normal catalysis, consistent with a lack of methyltransferase 
activity for these proteins*. SUVH2 and SUVH9 both contain SRA 
(SET- and RING-ASSOCIATED) domains capable of binding methy- 
lated DNA‘, suggesting that they function to recruit Pol V through 
DNA methylation. Consistent with this model, mutation of DNA 
METHYLTRANSFERASE 1 (MET1) causes loss of DNA methyla- 
tion, a nearly complete loss of Pol V at its normal locations, and 
redistribution of Pol V to sites that become hypermethylated. Further- 
more, tethering SUVH2 with a zinc finger to an unmethylated site is 
sufficient to recruit Pol V and establish DNA methylation and gene 
silencing. These results indicate that Pol V is recruited to DNA 
methylation through the methyl-DNA binding SUVH2 and SUVH9 
proteins, and our mechanistic findings suggest a means for selec- 
tively targeting regions of plant genomes for epigenetic silencing. 

To gain insights into the function of SUVH2/SUVH9, we solved the 
crystal structure of an amino-terminally truncated SUVH9 construct 
(residues 134-650), which contains all the known functional domains 
(the SRA, pre-SET and SET domains) (Fig. 1a, Extended Data 1a and 
Supplementary Table 1). The structure of SUVH9 is composed of three 
segments: a two-helix bundle towards the N terminus (residues 138-194), 
the SRA domain (residues 195-379) and the pre-SET/SET domains 
(residues 380-637). There are extensive inter-domain interactions that 
can stabilize the overall architecture of the protein (Fig. la and Extended 
Data Fig. 1b-g). 

The SRA domain of SUVH9 resembles those of UHRF1 and SUVH5"° * 
(Fig. 1b). Based on the SUVH5 SRA~mCHH (mCHH, methylated CHH, 
where H refers to A, T or C) DNA complex structure'’, we modelled a 
mCHH DNA into SUVH9 (Extended Data Fig. 2a). The DNA could be 
positioned in the nucleic-acid-binding cleft of the SRA domain without 
significant steric clashes and the proposed flipped-out 5-methylcytosine 
(5mC) base readily inserts into the binding pocket of the SRA domain. 

Although SUVH9 contains histone methyltransferase pre-SET and 
SET domains similar to Dim5, G9a and GLP'*"* (Extended Data Fig. 2b), 
it shows neither detectable histone methyltransferase activity nor bind- 
ing capacity for the SAM cofactor in vitro’. SUVH9 and SUVH2 lack 


post-SET domains that are normally critical for cofactor- and peptide- 
substrate-binding, as well as catalysis (Extended Data Fig. 3). Compared 
with the structure of human H3K9 methyltransferase GLP (Fig. 1c, d), the 
putative S-adenosylhomocysteine (SAH)-binding pocket and peptide- 
binding channel of SUVH9 are very open and incompletely formed 
(Fig. le, f) and cannot stably retain either a bound SAH molecule or the 
peptide substrate, especially in the absence of the stabilizing role of the 
post-SET domain. In summary, the SUVH9 structure is consistent with 
the demonstrated ability of SUVH9 to bind to methylated DNA, and 
supports the view that SUVH9 (and SUVH2) proteins encode inactive 
methyltransferase homologues. 

We sought to further characterize the role of SUVH2 and SUVH9 in 
the RNA-directed DNA methylation (RdDM) pathway by determin- 
ing their effect on siRNAs genome-wide. Most siRNAs are eliminated 
in Pol IV (mrpd1) mutants, whereas only some siRNAs are reduced in 
Pol V (nrpe1) mutants’ (Fig. 2a). We found that the suvh2 suvh9 double 
mutant reduced siRNA abundance at siRNA clusters that were depen- 
dent on both Pol IV and Pol V, but not at Pol V-independent clusters 
(Fig. 2a, b). Thus, the suvh2 suvh9 double mutant siRNA phenotype 
closely resembles that of a Pol V mutant. We next sought to determine 
whether SUVH2/SUVH9 might be involved in the production of non- 
coding transcripts by Pol V. At two characterized sites (IGN22 and 
P6)'*? we found that suvh2 suvh9 reduced Pol V transcripts to the same 
extent as nrpel (Fig. 2c). Similar results were observed at the AtSN1 
locus in suvh2 suvh9°. To test whether SUVH2/SUVH9 are required 
for Pol V chromatin occupancy, we used chromatin immunoprecipi- 
tation (ChIP) of a Flag-tagged NRPE1 (the largest subunit of Pol V). 
We observed only background levels of Pol V binding at IGN5 and 
IGN22 in suvh2 suvh9 compared to a sixfold enrichment in wild type 
(WT; Fig. 2d). We further analysed our ChIPs by next generation 
sequencing (ChIP-seq) and found that binding of Pol V at all prev- 
iously identified sites'* was significantly decreased or eliminated in 
suvh2 suvh9 (Fig. 2e, f). 

To determine the effect of SUVH2/SUVH9 on DNA methylation at 
defined Pol V binding sites, we used whole-genome bisulphite sequen- 
cing (BS-seq). As in nrpel, CHH methylation at Pol V binding sites 
was eliminated in suvh2 suvh9 (Fig. 3a). We also analysed BS-seq of the 
single mutants suvh2 and suvh9 to determine whether SUVH2/SUVH9 
act redundantly at all sites or have non-overlapping sites where they 
function. We found that suvh2 had a stronger effect than suvh9 at Pol 
V sites as well as at differentially methylated regions (DMRs) defined 
in either suvh2 or suvh9 single mutants, or in the suvh2 suvh9 double 
mutant (Extended Data Fig. 4a—c). These results indicate that SUVH2 
and SUVH9 act redundantly throughout the genome to control RADM. 

Our results suggest that a reinforcing loop exists between DNA methy- 
lation and Pol V binding via SUVH2/SUVH9. To further test this model 
we used a mutation in the maintenance methyltransferase MET1 that 
eliminates CG methylation genome-wide and also reduces CHG and 
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CHH methylation” ” (Fig. 3b). Using endogenous antibodies to NRPE1, 
ChIP-seq revealed that Pol V occupancy was virtually eliminated in 
met] at sites normally occupied by Pol V (Fig. 3c and Extended Data 
Figs 5-7). By contrast, at sites previously identified as gaining methyla- 
tion in met1*', we observed an increase in Pol V binding (Fig. 3d, e and 
Extended Data Figs 8 and 9). Furthermore, point mutations in the SRA 
domains of SUVH2 and SUVH9 were shown to cause a loss of RDM’. 
Together, these results indicate that SUVH2/SUVH9 binding to methy- 
lated DNA recruits Pol V, thus providing a link between pre-existing 
DNA methylation and the recruitment of further methylation by RADM. 
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Figure 1 | Crystal structure of SUVH9. a, Ribbon diagram of the SUVH9 
crystal structure containing a two-helix bundle, SRA domain, pre-SET domain 
and SET domain coloured in pink, green, orange and blue, respectively. The 
Zn3Cyss cluster is highlighted in a ball-and-stick representation and disordered 
regions are shown with dashed lines. b, A superposition of SUVH9 SRA 
domain (in green) and SUVH5 SRA domain (in silver) shows that both proteins 
adopt a similar fold. c, Top, crystal structure of human GLP in complex with 
bound SAH (PDB code 2IGQ) in a silver ribbon representation. Bottom, the 
SAH binding site in an electrostatic surface representation. The cofactor SAH is 
shown in a space-filling representation. d, Top, the crystal structure of human 
GLP in complex with SAH and H3K9me2 peptide (PDB code 2RFI) in silver 
ribbon representation. Bottom, the peptide binding site in an electrostatic 
surface representation. The post-SET domain and the acidic loop of the SET 
domain involved in peptide-substrate binding are highlighted in cyan and dark 
blue, respectively. The bound peptide is shown in a space-filling representation 
in both panels. e, Top, the crystal structure of SUVH9 in the free state in a 
colour-coded ribbon representation. Bottom, an expanded view of the putative 
SAH binding site in an electrostatic surface representation. f, Top, the crystal 
structure of SUVH9 in the free state in a colour-coded ribbon representation. 
Bottom, an expanded view of the putative peptide-binding site in an 
electrostatic surface representation. The long insertion loop of the SET 
domain is highlighted in magenta. 


To test directly whether SUVH2/SUVH9 may be sufficient to recruit 
Pol V, we used a zinc finger (ZF) to tether SUVH2 to an unmethylated 
epiallele of FWA, fwa-4. FWA is normally silenced owing to DNA 
methylation of tandem repeats in its promoter*’. FWA epialleles have 
lost this methylation, leading to ectopic expression of FWA and a 
heritable late flowering phenotype”. Surprisingly, siRNAs are still 
observed in fwa epialleles, but are unable to direct DNA methylation”. 
We proposed that Pol V transcripts may be missing in fwa epialleles 
and that recruitment of Pol V by SUVH2 might therefore stimulate 
FWA methylation and silencing. To test this we transformed a ZF- 
SUVH2 fusion protein construct into the fwa-4 epiallele (Extended 
Data Fig. 10a). As negative controls, we also transformed a haemag- 
glutinin (HA)-tagged SUVH2 line without the ZF (HA-SUVH2) as 
well as a construct in which the ZF was fused to KRYPTONITE (KYP/ 
SUVH4) (a SUVH protein not required for RADM). 

Approximately 75% of the fwa-4 plants transformed with the ZF- 
SUVH2 (T1 generation) flowered early as compared to the parental 
fwa-4 line, suggesting silencing of FWA (Fig. 4a). The fwa-4 line trans- 
formed with HA-SUVH2 or ZF-KYP flowered at the same time as the 
fwa-4 parent, showing that the effect was specific to the ZF-SUVH2 
fusion. Flowering time was measured in the T2 generation confirming 
these observations (Fig. 4b). The presence of the control ZF-KYP at 
FWA was shown by ChIP (Extended Data Fig. 10b); however, we were 
unable to detect the ZF-SUVH2, most likely owing to its instability or 
low abundance. 

We next used bisulphite sequencing to determine whether FWA gene 
silencing was associated with DNA methylation. In wild type, DNA 
methylation was detected over a large region, whereas in both fwa-4 
and transformants with ZF-KYP or HA-SUVH2, this region was devoid 
of DNA methylation (see ZF-KYP; Fig. 4c). In three independent ZF- 
SUVH2 T1 lines, DNA methylation was observed immediately around 
the Zn finger binding sites in all cytosine sequence contexts (Extended 
Data Fig. 10c). We analysed one of the lines (ZF-SUVH2-2) in the T2 
and T3 generations using BS-seq and found that methylation extended 
approximately 150 base pairs in either direction from the binding sites 
and did not expand significantly between generations (Fig. 4c). FWA 
methylation and gene silencing were maintained in T2 plants that had 
segregated away the ZF-SUVH2 transgene (Extended Data Fig. 10d), 
indicating that targeting by SUVH2 is capable of inducing DNA methy- 
lation and gene silencing that can be maintained in the absence of the 
initial trigger. 

To determine whether Pol V was present at FWA, we used NRPE1 
ChIP. As expected, we found enrichment of Pol V at two known Pol V 
sites, IGN5 and IGN22, in the wild-type and fwa-4 lines, but not in 
nrpel mutant plants (Fig. 4d). At FWA, we found enrichment of Pol V 
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Figure 2 | SUVH2 and SUVH9 are required for Pol V-dependent siRNA 
production, chromatin binding and transcription. a, Boxplot (whiskers 
extend to + 1.5 interquartile range (IQR)) of reads per kilobase per million 
reads (RPKM) values for 24-nucleotide siRNAs at previously defined siRNA 
clusters dependent on Pol IV (NRPD1), but not Pol V (NRPE1). 24-nt 
counts allow for up to 100 identical reads to be counted at any given position. 
* indicates a significant decrease (P< 2.2 X 101°, Mann-Whitney U-test). 
b, Similar to plot in a for clusters defined as dependent on Pol IV and Pol V. 
c, Quantitative PCR with reverse transcription (RT-qPCR) of IGN22 and P6 
relative to ACTIN7 and normalized to Columbia-0 (WT). Mean + standard 
deviation (s.d.) of two biological replicas. d, Quantitative PCR (qPCR) of IGN22 
and IGNS from Flag ChIP shown as enrichment of IP/input relative to ACTIN7 
in NRPE1-Flag/WT and NRPE1-Flag/suvh2 suvh9 lines. Mean ~ s.d. of two 
biological replicas. e, Heat map of NRPE1 enrichment at defined NRPE1 sites 
determined by Flag ChIP-seq in either NRPE1—Flag/WT or NRPE1-Flag/ 
suvh2 suvh9, with Flag ChIP in WT as negative control. f, Box plot of NRPE1 
enrichment at sites shown in Fig. 2e for NRPE1-Flag/WT and NRPE1-Flag/ 
suvh2 suvh9. 


in the wild type at both the promoter and transcript regions, but not in 
nrpel or fwa-4 (Fig. 4d). However, in the ZF-SUVH2-transformed 
fwa-4 plants we could now see enrichment of Pol V at FWA (Fig. 4d), 
indicating that SUVH2 is sufficient to recruit Pol V. 

To look for a direct interaction between SUVH2/SUVH9 and Pol V, 
we queried several IP-mass spectrometric data sets from purifications of 
NRPE1-Flag, but failed to find any SUVH2/SUVH9 peptides. However, 
we did identify SUVH2 peptides in two independent mass spectrometric 
data sets from DRD1 purifications® (Supplementary Table 2). DRD1 is 
a component of the DDR complex (also containing DMS3 and RDM1) 
which interacts with Pol V° and is required for Pol V occupancy through- 
out the genome’®. The number of SUVH2 peptides observed was lower 
than those from the DMS3 and RDM1 proteins and also lower than the 
level of peptides of most Pol V complex components, indicating that 
the interaction between SUVH2 and DRD1 is weaker or more transient 
than the interaction between the DDR components or between DDR and 
Pol V. To confirm the interaction, we performed co-immunoprecipitation 
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Figure 3 | Pol V binding is dependent on DNA methylation. a, Metaplot of 
per cent CHH methylation at all defined NRPE] binding sites as determined by 
BS-seq in wild type (WT), nrpel and suvh2 suvh9. b, Box plots showing DNA 
methylation in each cytosine context at defined NRPE1 binding sites in WT 
and met1. c, Pol V occupancy in metl is reduced at NRPE1 sites. d, Metaplot 
of DNA methylation at sites defined as hyper methylated in met. e, Pol V 
occupancy in met1 is increased at defined hyper methylated sites. 


experiments with HA-tagged SUVH2 and Flag-tagged DRD1 in trans- 
genic Arabidopsis plants and were able to detect HA-SUVH2 ina Flag— 
DRD!1 pull-down (Fig. 4e). We were also able to detect Flag-DRD1 ina 
pull-down using HA~SUVH2 expressed in leaves of Nicotiana benthami- 
ana and purified on HA magnetic beads (Extended Data 10e). These 
results confirm the IP-mass spectrometry observations and are con- 
sistent with a model in which SUVH2 acts indirectly via a transient 
interaction with DRD1 to recruit Pol V. 

Because RdDM is triggered at genomic locations that synthesize both 
Pol IV-dependent siRNAs and Pol V-dependent non-coding tran- 
scripts, it is critical to understand the mechanisms that direct Pol IV 
and Pol V to chromatin. Furthermore, because RADM functions to silence 
transposons that do not necessarily share consensus DNA sequences, a 
strong prediction is that Pol IV and Pol V will use epigenetic informa- 
tion in their targeting. We propose that SUVH2/SUVH9 serve as crit- 
ical components of this targeting system by recruiting Pol V to DNA 
methylation. We recently showed that recruitment of Pol IV involves 
SHH1, a protein that binds to histone H3 lysine 9 methylation, a silen- 
cing mark also found at RdDM sites****. Therefore, the dual marks of 
DNA methylation and histone H3 lysine 9 methylation seem to act in 
self-reinforcing loop mechanisms to maintain Pol V and Pol IV at sites 
of RdDM. These mechanistic findings also suggest a means for selec- 
tively targeting DNA methylation by directing both siRNAs and Pol V 
to overlapping regions of genome, paving the way to chromatin engin- 
eering in crop plants. 

Similar to other components of RADM, SUVH2/SUVH9 are required 
for the establishment of de novo DNA methylation during transforma- 
tion of an unmethylated FWA gene into Arabidopsis*. An interesting 
question is how an originally unmethylated DNA is first recognized as 
an RdDM target if SUVH2/SUVH9 DNA methylation-mediated Pol V 
recruitment cannot initially occur. One possibility is that shortly after 
the transgene is integrated into the genome, a permissive chromatin 
environment may allow RNA Pol V (or another polymerase) to produce 
surveillance transcripts that might serve to initiate RADM through 
siRNA-mediated recruitment of DRM2. After this initial round of 
DNA methylation, SUVH2/SUVH9 could then bind methylated DNA 
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Figure 4 | Tethered SUVH2 recruits Pol V through DRD1, resulting in 
DNA methylation and a late-flowering phenotype. a, Plants grown side-by- 
side to illustrate early flowering of ZF-SUVH2 in fwa-4 (T2 plants) compared 
to fwa-4. b, Flowering time of Columbia-0 (WT), ZF-SUVH2 in fwa-4, 
ZF-KYP in fwa-4, HA-SUVH2 in fwa-4 and fwa-4. Flowering time was 
determined by counting all rosette and cauline leaves up until the terminal 
flower. The average leaf number and standard deviation from about 20-30 
plants were determined. Mean + s.d. c, Per cent methylation at each cytosine 
in the FWA repeat region as determined by BS-seq in T2 and T3 ZF-SUVH2/ 
fwa-4 plants compared to T2 ZF-KYP/fwa-4 (unmethylated) and WT 
(standard methylation pattern). ZF binding sites are shown in green and the 
FWA gene in blue. d, NRPE1 ChIP in WT (positive control), nrpel mutant 
(negative control), fwa-4 epiallele and ZF-SUVH2/fwa-4. qPCR results of two 
well-characterized NRPE1 binding sites (IGN5 and IGN22) and two regions in 
FWA (FWAp, promoter; FWAt, transcript) are shown as enrichment of IP/ 
input relative to negative control. Mean = s.d. of two biological replicas. 

e, Coimmunoprecipitation of HA-SUVH2 in Arabidopsis using Flag~-DRD1. 
Left, inputs from the two parental strains (expressing either HA-SUVH2 or 
Flag—-DRD1) and an F2 line expressing both HA-SUVH2 and Flag—-DRD1. 
Right, elution from Flag-magnetic beads. Top, HA western blots; bottom, 
Flag western blots. 


and recruit Pol V for subsequent rounds of transcription. In this way, 
a self-reinforcing loop would be established in which pre-existing DNA 
methylation is required for the maintenance of RNA-directed DNA 
methylation. 


METHODS SUMMARY 


All biological materials and extended methods are described in detail in the Methods. 
siRNA libraries were made from flowers of the indicated lines as described prev- 
iously”* and Pol V transcription assays were performed as described previously’. 
ChIP-seq, BS-seq, co-immunoprecipitation, protein purification and crystalliza- 
tion, and data analysis are described in the Methods. Supplementary Table 1 provides 
crystallography data, Supplementary Table 2 provides IP-mass spectrometry data 
and Supplementary Table 3 provides primers used in this study. 
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Online Content Any additional Methods, Extended Data display items and Source 
Data are available in the online version of the paper; references unique to these 
sections appear only in the online paper. 
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At a crossroads 


China is luring back expatriates with generous incentives, 
causing many to weigh the pros and cons of returning. 


BY QUIRIN SCHIERMEIER 


year ago, Jun Yao was facing a 
Ai dilemma. Having completed a 

PhD and two postdoctoral stints in the 
United States, the Chinese-born neuroscientist 
was well placed to pursue an academic career 
in his adopted country. But recruiters in China 
had also taken note of his achievements and 
were offering him substantial incentives to 
continue his research back home. Should he 
stay or go? 

When Yao had left China in 2002, the United 
States had obvious advantages as a place for 
launching his career. He did a PhD at Pennsyl- 
vania State University (Penn State) in College 


Park, followed by postdocs at the University of 
Wisconsin-Madison and at the Salk Institute 
for Biological Studies in La Jolla, California. 
But he knew that the environment for science 
research, and especially funding, had nota- 
bly improved in his homeland over the past 
ten years. So much so, in fact, that Tsinghua 
University in Beijing — one of the nation’s 
top-tier institutions — was now able to offer 
him a position as principal investigator in 
neuroscience and a generous starting package. 
Based on what Yao had seen of offers in the 
United States, the deal was exceptional. 

Yao mulled over the best way forward, and 
in the end he opted to return. (Soon after, a US 
university expressed strong interest. But Yao 
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declined, having made his decision.) During 
interviews for the job, he had been impressed by 
the high calibre and hard-working nature of the 
students. “I had not made up my mind before I 
went to Beijing for the interviews,” he says. “But 
when I saw the new labs and modern facilities 
they have built, and when I spoke with students 
and scientists at Tsinghua, I decided that this 
is the place for me to do science in the future” 

China is counting on Western-trained 
researchers such as Yao to help the nation boost 
its scientific status to a level similar to that of the 
United States, Europe and Japan. And it is fork- 
ing out the money to do so: in 2012, the nation 
spent 1.98% of its gross domestic product on 
research. That is slightly higher than Europe's 
allocation for research and, with the amount 
growing by approximately 20% a year, China's 
proportional spending on research edges ever 
closer to the 2.79% of the United States and the 
3.34% of Japan. 

But China's political environment some- 
times gives pause to expatriates who are 
accustomed to living and working in a demo- 
cratic society — the nation’s human-rights and 
freedom-of-speech issues worry some. Also of 
concern are the quality of science at some of 
China's lesser-known institutes and univer- 
sities; the level of expectation that returnees 
often face as a foreign-trained expert; and the 
fact that funding streams still often depend 
on connections to ‘old-boy networks’. “For 
Chinese scientists who have been abroad for 
years, it is difficult to be reintegrated into the 
system,” says Cong Cao, a sociologist and 
expert on Chinese science who is based at the 
University of Nottingham, UK. 


TALENT CONTEST 

To bring in the talent, China is embracing 
science and coming up with concrete plans to 
lure back promising expats. Its “Thousand Tal- 
ents’ plan — to which Yao successfully applied 
with assistance from the university — aims to 
entice foreign-trained professionals to dedi- 
cate their skills to building the government's 
vision for a twenty-first-century China, which, 
according to statements from vice-president 
Li Yuanchao, includes both wealth creation and 
technological exchange with other nations. As 
part of a special section of the Thousand Tal- 
ents plan, scientists, scholars and engineers 
under 40 years old who have at least three 
years of postdoctoral experience at a reputable 
non-Chinese university can apply for a posi- 
tion at a Chinese university or industry labora- 
tory. Successful applicants get grant money 
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> and relocation funds worth several million 
renminbi (depending on the research project); 
research projects typically run for three years. 
Candidates apply to the university or institute 
of their choice, and successful applicants then 
negotiate a salary and terms of employment 
with their host institute. Attracted by the pros- 
pect of lucrative funding and new career oppor- 
tunities, about 3,000 scientists have heeded the 
call since the scheme was established in 2008. 

By all indications, there will be plenty of 
applicants for years to come. China’ scientific 
diaspora numbers well over 400,000 scientists 
and other scholars. In the United States, for 
example, there is scarcely a research depart- 
ment that does not have Chinese-born PhD 
students and postdocs within its ranks. Many 
Chinese researchers leave their home country 
in the hope of finding permanent jobs — in 
or outside academia — and many do. But as 
funding opportunities in the United States 
and elsewhere decline, they are increasingly 
considering career opportunities back home. 

“Tve been in the United States for 20 years 
now and never has it been more difficult to 
secure grant money, especially for early-career 
scientists,” says Yao'’s former PhD supervisor 
Gong Chen, a neurobiologist at Penn State in 
University Park. “I do tell my students that it’s 
a good time to do science in China — but I 
also tell them about the downsides.” 


PROCEED WITH CAUTION 
Those downsides are not inconsiderable, and 
require careful study of any potential new 
workplace. “Go to China,” recommends Yao, 
“and look around before you apply for a posi- 
tion or sign a contract.” 

For instance, department heads and research 
administrators often have unrealistic expecta- 
tions about what repatriates can accomplish 
ina short time, says Chen. “There is immense 
pressure on returnees to instantly produce sci- 
entific results and publish in prestigious jour- 
nals,” he says. “One is right to expect hard work 
and determination from young scientists. But 
they need more time and support than they are 
often being granted.” 

Newcomers will typically need to start their 
research from scratch in more-or-less empty 
laboratories and with little knowledge of local 
circumstances. “Be social — good relation- 
ships with colleagues and superiors are essen- 
tial, after all” says Chen. “But do also prepare 
for difficulties and delays that you will inevita- 
bly face in a new environment, no matter how 
hard you are willing to work” 

One of the main concerns is where the 
position is based. Only a few Chinese institu- 
tions — among them Tsinghua University and 
Shanghai's Fudan and Jiao Tong universities — 
offer conditions that are comparable to those at 
top Western institutes. Researchers there can 
count on state-of-the-art lab equipment, top- 
quality collaborations with foreign institutions, 
adherence to the precepts of research integrity, 
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Expatriates might find it difficult to reintegrate into the Chinese system, says sociologist Cong Cao. 


the absence of old-boy-network practices and 
a high calibre of students, faculty and visitors. 
But at less-esteemed universities in smaller 
cities, the level of science is often second-rate, 
says David Zweig, chair of social sciences at the 
Hong Kong University of Science and Tech- 
nology. He suggests avoiding universities that 
allow professors to keep on their own PhD stu- 
dents routinely as they become postdocs and 
professors. “You only want to go to places that 
send students out to the world” 

Returnees should also seek out a mentor who 
is familiar with local circumstances and can help 
them to acclimatize and to deal with problems 
if they arise. “As an outsider, youd otherwise be 
pretty much lost? Zweig says. “You might not 
even be able to get any research money.” 

He and others say that at many institutes, 
getting funded or promoted often depends 
on forming personal relationships with local 
research administrators or party officials. 
Reports of plagiarism, theft of ideas and other 
fraudulent behaviour are common. All of this 
can add up to a situation in which Chinese 
scientists are more productive when they are 
abroad than when they are in China, says Rao 
Yi, a biologist formerly with Northwestern 
University’s medical school in Chicago, IIli- 
nois, who became dean of the School of Life 
Sciences at Peking University in Beijing in 
2007. “If they care about their research and 
integrity, those who might return should go 
for institutions that have reformed or are deter- 
mined to reform,” he says. 

Generous benefits to returnees often cause 
hong yan bing — ‘red-eye disease; as envy is 
often called in China — among other Chinese 
academics. The Thousand Talents plan makes 
no specific provision as to the terms of employ- 
ment of returnees, such as salary or period of 
employment; those terms differ depending on 
the host institution. But high-profile awardees 
can get instant tenure and a salary that is con- 
siderably higher than the 200,000 renminbi 
(US$32,650) per year of an average Chinese 
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professor. Such efforts, says Yi, sometimes cause 
Chinese professors to view senior returnees as 
competitors for funds and fame. Cao agrees: 
“There are often tensions between people 
with and without foreign education,’ he says. 
“Frankly, some do not want expats to return to 
China permanently” 

China expert Denis Simon, vice-provost of 
the Office of International Strategic Initiatives 
at Arizona State University in Phoenix, says 
that the problem is especially difficult when 
the recruit is not Chinese-born. Another 
government incentive — the ‘Recruitment 
Program of Foreign Experts’ — encourages sci- 
entists with a PhD from an overseas university 
to work in China for at least nine months a year 
over a minimum of three consecutive years, 
and it offers rewards that are similar to those 
of the Thousand Talents scheme. “It’s one thing 
to attract Chinese-born postgraduates from 
overseas, he says. “Bringing senior scientists 
to China is a different story. It’s hard to swal- 
low for China’ old guard that Mr Smith gets so 
much when their Mr Wang gets so much less.” 

For his part, Yao knows that challenges lie 
ahead, but he has found much to encourage 
him. He has been assigned a 180-square-metre 
laboratory on the Tsinghua campus on the out- 
skirts of Beijing — more research space than 
he ever had in the United States. It will take a 
good few months to buy the equipment and 
hire the technicians and postdocs he needs. To 
remain busy and productive, he maintains an 
active research alliance with the group he left 
behind in California. 


PREPARING TO RETURN 

Chinese researchers would be wise to establish 
a scientific reputation for themselves before 
they consider going back to their homeland, 
says Cao. “Don’t move too early in your career,” 
he says. “Make sure you have done at least a few 
years of productive postdoctoral research and 
produced some nice papers, else you'll have the 
air of one who is not competitive in the West.” 


The Thousand Talents plan also includes 
a popular short-term option for scientists 
who are reluctant to sever ties with the 
West altogether. The awardees who select 
this route keep their overseas academic jobs 
and obligations while working up to three 
months per year — usually during the sum- 
mer break — at a Chinese host institute. 

Those moving back should consider 
the scientific opportunities and family- 
friendly provisions of, for example, the 
Thousand Talents plan, which often 
includes premium medical service, free 
insurance and eligibility to purchase prop- 
erty at discount rates. But they should also 
be aware that a successful application is 
no guarantee of a permanent job in sci- 
ence. The tenure track to a permanent 
academic position — so familiar in the 
United States — is widely absent in China. 
In the past, says Chen, pretty much anyone 
who obtained a PhD could look forward 
to a job for life. But leading universities in 
Beijing and Shanghai have begun to evalu- 
ate junior faculty members’ performance 
after five years and to make promotions to 
permanent positions on the basis of those 
evaluations. 

The Chinese government — always 
struggling to reconcile communist ideals 
with the transition to a market economy 
— is aware of the grumbling within its 
academic insti- 
tutions. But the 
Communist Party 
has made it clear 
that bringing back 
the best is crucial 
to the nation’s 
economic devel- 
opment. During 
his 2007-12 term 
as head of the 
i Communist Par- 
ty’s organization 


“Tdo tellmy 7 department, Li 
students thatit’s —Yyanchao stressed 
agoodtimetodo that bringing 
scienceinChina _ back expatriates 
— but Ialso tell was a “strategic 
them about the investment”. 

downsides.” Critics say, 
Gong Chen however, that if 


China really wants 
to bring back the best, it must overhaul its 
science culture and its academic institu- 
tions to embrace democratic pluralism 
and improve transparency. The idea is that 
more returnees from the West would not 
just help the development of Chinese sci- 
ence, but that of society as well. “The inflow 
of young science talent from abroad,” says 
Chen, “will help shape our nation” = 


Quirin Schiermeier is Nature’ Germany 
correspondent. 
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A powerful narrative 


Scientists should find engaging ways to present 
information to their target audience, says Yoshimi Rii. 
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episode on the first day of a teacher work- 

shop. Along with three other scientists, 
I gave talks to schoolteachers on the use of real- 
time scientific data in the classroom. Because 
we didn't want to talk down to our audience, 
we kept to the standard scientific format for 
our presentations, with minor adjustments to 
specific jargon. As the day went on, however, it 
became apparent that we had missed the mark. 

“Why do scientists’ presentations all look the 
same?” said a teacher to me during the break. 
“Tt’s like you guys all get the same template.” Her 
words struck me like a 10-tonne anvil. She was 
right. Where was the creativity? 

Most of us at one time will have attended a 
talk about what should have been an engaging 
topic, only to find ourselves being lulled to sleep. 
Most likely, the speaker was using PowerPoint 
or Keynote. There are better approaches. 


| ast summer, I took part in an unfortunate 


MAKING THE DISCONNECTION 

The user-friendly nature of software such as 
PowerPoint allows anyone to make a presen- 
tation without much thought. Type in a few 
talking points, throw in some pictures and 
voila! Here’s my cookie-cutter talk. Never 
mind that there’s too much information on 
some slides — they're bulleted, so they must 
be easy to understand! The plot makes five or 
six points, but hey, it’s colourful and impres- 
sive, so that’s OK, right? And let’s not worry 
about those bumpy transitions, never mind 
the lack ofa coherent narrative. 

Reliance on bullet points and complicated 
graphs has caused many to become laxat apply- 
ing important performance skills. Disconnected 
from the public, scientists often forget to explain 
what to them is everyday lingo. Last September, 
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at a press conference at the University of Hawaii 
in Manoa for the Intergovernmental Panel on 
Climate Change Working Group I Report, 
scientists showed modelled rises in sea level 
from now until 2100. I watched the eyes of the 
reporters roll backwards in their sockets. 

Because slides are easy to recycle from one 
presentation to another, researchers often give a 
talk on autopilot using slides that they've shown 
hundreds of times before. If the presenter is 
disengaged, how can they expect the audience 
to listen? 


BREAKING BAD 

On the second day of the teacher workshop, we 
decided to scrap our prepared talks and start 
afresh. I placed a quote from Moby Dick in the 
title box and was rewarded with a smile from an 
English teacher in the audience. I then tossed a 
microbe-shaped stuffed toy to a teacher in the 
front row to open up a discussion about bacteria 
and phytoplankton. We asked them how they 
wanted to illustrate these concepts to their stu- 
dents. For the remainder of the workshop, the 
talks became less structured, more interactive 
and better appreciated. I vowed never to rely on 
cookie-cutter presentations again. 

Last October, I found myself at a foundation 
symposium with an audience of retired doc- 
tors, professors and other distinguished society 
members who had made donations in support 
of research at the university. I sat next to one 
donor, a 90-year-old veteran who was part of 
the highly decorated 442nd Regimental Com- 
bat Team in the Second World War. As part of 
the symposium, senior scientists and I were 
invited to give talks on exciting research being 
conducted at our university. I was nervous. 

But I found myself breathing a sigh of relief. 
In the first talk, a scientist studying the his- 
tory of calendars opened up a star-studded 
umbrella. Another scientist presented a beau- 
tiful slide show of mushroom pictures and riv- 
eted the audience with accounts of his hunts in 
unusual places. At the end of the symposium, 
the veteran next to me grabbed my hands and 
said, “I feel great knowing that my money 
has made all of this possible. Thank you” We 
relayed our message that day, and without the 
aid ofa single graph. m 


Yoshimi Rii is a graduate student in 
phytoplankton ecology at the University of 
Hawaii in Manoa. 
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Ua SCIENCE FICTION 


UNIT SIMULATION 


BY COY ST CLAIR 


he street was 
empty for as far as 
Jane could see, and 


she was resigned to another 
night without a customer. 
This would be three 
nights in a row. Yuri 
would not be happy. Jane 
pulled her coat closer 
against the evening chill 
and began walking. 

By the time she reached 
the end of her territory, 
three cars had passed 
without slowing down, 
even though she had 
looked hopefully at each 
one. It was insulting, 
really. She was a profes- 
sional, after all. She may 
have been dressed in nor- 
mal clothes, but there was 
no mistaking what she was, 
especially in this part of town. 
As she turned to make a slow 
walk down to the other end of her 
territory, a sedan slowed by the cor- 
ner. She approached confidently as the 
passenger window rolled down. 

“Looking for a good time, honey?” Jane 
said, leaning down to look inside the car. 
The man was not unattractive, with dark 
hair and nervous blue eyes. He nodded 
apprehensively. Jane smiled her warmest 
smile. She prided herself on her ability to 
reassure the customer, to make them feel 
comfortable and safe. The real trick of the 
trade, she knew, was to understand that 
safety and comfort were what they were 
really after. 

“Do you... you know, do the whole deal?” 
the man said, raising his eyebrows. 

“No, honey. If that’s what you're after you 
need to go down a few blocks. This is Unit 
Simulation only,’ she said, masking the dis- 
appointment. 

“No, no,’ he said, shaking his head quickly, 
“that’s what I want. Just the Simulation” 

She smiled again. “Sure thing, darling. 
Just head around the corner. There's a park- 
ing structure there. Just park up and meet me 


at the alley around the 
> NATURE.COM back.” 
Follow Futures: He nodded again 
© @NatureFutures and pulled away. 
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Reconnect with your past. 


the back of the building and waited. A few 
moments later, the man appeared from the 
structure, furtively glancing around to see if 
anyone was watching. He was a young pro- 
fessional type, dressed in a dark suit. Jane 
smiled a winsome smile. 

“Dont worry, darling. Nobody comes 
around here. We're safe,” she said. “My fee is 
one hundred for two hours.’ She paused with 
an expectant look for his assent. The man 
nodded. “Good. This way.” Jane took the 
young man’s hand and led him to a stairwell. 

Upstairs they entered the studio. There 
was a couch with matching rug, a fireplace 
that gently crackled, books on a coffee table 
— all the little details of a family home from 
a bygone era, down to the children’s toys 
that were casually strewn about. 

“Just take off your coat and sit down,” Jane 
said, indicating the couch. “Tl be right back” 

The man complied and sat silently as Jane 
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left the room. In moments she returned, 
wearing pyjama bottoms and a cotton 
robe. Two children held her hands as she 
approached the man. They were Synthetics, 
but if you didn’t look too closely it might 
not be obvious. Jane knew that 
her customers didn’t come here 
to examine things closely. They 
came to experience something 
that maybe their genes still 
remembered, and those 
types weren't interested in 
picking apart the details. 
The man smiled hesi- 
tantly and the children 
smiled back. They 
hugged his neck and 
giggled. When they 
released him and went 
to play quietly on the rug 
with their toys, Jane sat 
on the couch beside the 
man. She placed a gen- 
tle arm around him and 
drifted into thought as they 
placidly watched the ‘children’ 
play, just as she always did. 
There were advantages to the 
breeding programme, to be sure. 
Disease and deformity had nearly been 
eradicated, people were more intelligent, 
and the human race had finally begun to 
live in harmony with the planet, instead of 
at odds with it. But the price, Jane thought, 
was steep. Children, created and raised by 
the authorities, were perfect in every way, 
and perfectly sterile, just like Jane and her 
client. It had been generations since a true 
family had existed. Somewhere deep down, 
Jane knew, people felt that something was 
missing. The population was stable, meticu- 
lously regulated, but people could feel the 
absence of something lost. 

Jane was proud. Even after Family Unit 
Simulation had been outlawed as a ‘distur- 
bance of moral judgment and psychological 
well-being, Jane had found a way to con- 
tinue her trade. She knew what they really 
wanted, and she gave it to them, if only for 
two hours at a time. 

The man looked at Jane with wet eyes and 
she pulled his head to her shoulder. 

“There, there, darling. Everything is all 
right. You're home now: m 


Coy St Clair is a biologist and is presently 
putting the finishing touches on his first 
novel, The Exile of William Bend. 
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